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Modeling  Fault  Diagnosis  Performance 
on  a  Marine  Powerplant  Simulator 
Yuan-Liang  D.  Su 
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Directed  by  Dr.  T.  Govindaraj 

Human  operators  of  large  dynamic  systems  must  possess  good  fault 
diagnosis  skills.  Factors  affecting  fault  diagnosis  skills  are  investi¬ 
gated  in  this  thesis,  and  a  model  for  fault  diagnosis  is  developed. 

A  low  fidelity  simulator  of  a  marine  powerplant,  was  developed 
and  used  to  study  expert  marine  engineers'  fault  diagnosis  performance. 
Based  cc  the  data  collected,  factors  affecting  the  fault  diagnosis  per¬ 
formance  were  identified.  They  are:  the  initial  feasible  set  (IFS)  and 
the  transition  strategy.  A  subject's  initial  set  of  actions  upon 
observing  the  symptoms  forms  the  IFS  which  reflects  the  subjects' 
knowledge  about  a  given  failure.  Transition  strategy  is  concerned  with 
the  schemes  subjects  used  to  shift  from  the  hypothesis  formation  stage 
to  hypothesis  evaluation  stage.  Two  types  of  strategy  were  identified: 
breadth-depth  strategy  and  balanced  strategy.  The  data  seem  to  indicate 
that  subjects  who  had  good  IFS  and  used  breadth-depth  strategy  performed 
better  than  subjects  who  had  bad  IFS  and  used  balanced  strategy.  This 
finding  seems  to  imply  that  both  context-specific  (IFS)  and  context-free 
(transition  strategy)  factors  are  important  for  fault  diagnosis  train¬ 
ing. 

A  model  for  the  fault  diagnosis  process  which  employs  a  concep¬ 
tual  entity  called  hypothesis  frame  to  account  for  the  observations  from 


data  and  protocols  has  been  proposed.  Relevant  information  about  a 
failure  is  "compiled"  in  the  frames  so  that  subjects  could  make  quick 
and  reliable  inferences.  Given  symptoms  of  a  failure,  subjects  would 
first  try  to  match  a  frame.  Once  a  frame  is  identified,  subjects  could 
use  the  information  contained  in  the  slots  to  make  inferences.  When  no 
obvious  frames  could  be  processed,  subjects  could  search  in  the  system 
or  symptom  knowledge  base  for  relevant  information  under  the  guidance  of 
heuristics.  This  model  could  form  the  basis  for  implementing  an  on-line 
coach  system  for  fault  diagnosis. 


CHAPTER  I 


INTRODUCTION 


Humans  encounter  various  types  of  problems  everyday.  Some  of 
these  difficult  problems  require  the  human  to  have  advanced  skills  and 
training  in  order  to  be  able  to  solve  them.  The  quality  of  the  solu¬ 
tions  and  the  rapidity  with  which  the  problems  are  solved  depend  very 
much  on  the  competence  of  the  problem  solvers.  Therefore,  it  is 
interesting  and  useful  to  find  out  what  type  of  skills  a  problem  solver 
should  possess  to  solve  problems  successfully. 

Human  problem  solving  has  been  studied  from  several  different 
perspectives.  Within  the  past  few  decades,  mathematicians  (Polya  1945), 
computer  scientists  (Newell  and  Simon  1972,  Shortliffe  1976,  Amarel 
1983,  Davis  1983),  cognitive  psychologists  (Tversky  and  Kahneman  1975, 
Anderson  1980)  and  system  engineers  (Duncan  and  Shepherd  1975,  Rouse 
1981,  1983,  Rouse  and  Hunt  1984)  have  contributed  substantially  to 
understanding  of  human  problem  solving.  Due  to  the  complexity  of  real¬ 
istic  problems,  most  of  the  prooiems  studied  in  a  controileu  .aooratory 
environment  were  simple  game-type  problems.  There  are  many  remaining 
questions  regarding  how  results  from  this  previous  research  can  be 
applied  to  real  problem  solving  environments  such  as  troubleshooting  in 
a  large  complex  engineering  system. 

The  human  operator's  role  in  the  operation  of  large  complex  sys¬ 
tems  has  become  increasingly  supervisory  and  managerial,  requiring 
higher  level  perceptual  and  cognitive  skills.  Operator  intervention 


occurs  when  an  undesirable  or  unforseen  event  requires  actions  to 
detect,  diagnose  and  compensate  for  failures,  or  dynamic  conditions. 
Therefore,  the  operation  of  a  large  system  sometimes  depends  on  the 
problem  solving  skills  of  the  operators.  This  type  of  problem  solving 
is  usually  referred  to  as  fault  diagnosis  in  the  literature  (Duncan  and 
Shepherd  1975,  Rouse  and  Hunt  1984). 

Fault  diagnosis  is  a  complex  activity,  requiring  a  considerable 
amount  of  practice  and  experience  before  one  becomes  proficient.  Simu¬ 
lators  are  normally  used  to  train  people  to  improve  their  fault  diag¬ 
nosis  performance.  One  objective  of  this  research  is  to  identify  the 
skills  required  and  factors  affecting  fault  diagnosis  performance,  so 
that  training  simulators  can  be  designed  for  specific  operational 
environments . 

In  addition  to  training  on  actual  systems  or  simulators,  fault 
diagnosis  performance  of  operators  can  be  improved  considerably  by  pro¬ 
viding  advice  or  aid  at  appropriate  times.  An  effective  computer  aid 
can  function  as  an  on-line  coach  for  troubleshooting  oy  monitoring  sys¬ 
tem  states  and  processing  information  just  like  a  auman  operator  would 
in  a  fault  diagnosis  situation.  A  good  model  of  human  operator  fault 
diagnosis  performance  is  necessary  to  design  an  aid.  The  model  must 
incorporate  all  relevant  symptom,  system  and  failure  knowledge  into  a 
coherent  structure.  This  model  must  also  be  able  to  reason  and  diagnose 
like  a  domain  expert.  A  major  problem  in  developing  such  a  model  is  to 
determine  what  types  of  knowledge  should  be  included  and  how  they  should 
be  represented.  In  this  thesis  a  model  for  fault  diagnosis  is  proposed 


as  a  preliminary  step  for  building  of  an  on-line  coach. 

In  order  to  achieve  the  objectives  discussed  above,  problem  solv¬ 
ing  performance  of  mar'ne  engineer  trainees  was  studied  via  a  series  of 
experiments  conducted  at  Marine  Safety  International  (MSI),  in  New  York. 
The  trainees,  who  were  subjects  in  our  experiments,  were  participants  in 
a  training  course  designed  to  teach  marine  engineers  with  10  to  20  years 
of  experience  how  to  diagnose  and  compensate  for  powerplant  failures  in 
a  steam-powered  ship.  During  the  training,  realistic  powerplant 
failures  compiled  by  expert  marine  engineers  were  used  on  simulators  of 
different  fidelity  levels.  Their  problem  solving  performance  was  care¬ 
fully  monitored.  Factors  affecting  fault  diagnosis  are  identified  and 
tested.  Also  a  process  model  of  fault  diagnosis  is  proposed  and  used  to 
interpret  protocols  collected. 

Simulators  of  a  marine  powerplant  were  used  to  create  a  realistic 
atmosphere  so  that  "real"  fault  -diagnosis  behavior  were  observed.  If  a 
simulator  resembles  a  real  ship  in  providing  an  environment  for 
failures,  the  behavior  of  trainees  should  be  reasonably  realistic.  How¬ 
ever,  resemblance  or  realism  is  not  easy  to  measure.  Conventionally, 
transfer  of  training  experiments  are  conducted  to  achieve  this  purpose. 
But  transfer  experiments  are  difficult,  expensive  and  sometimes  impossi¬ 
ble  to  conduct  (Adams  1979,  Duncan  and  Shepherd  1975).  Therefore, 
instead  of  conducting  a  transfer  of  training  study,  we  used  a  descrip¬ 
tive  approach.  We  analyzed  the  factors  affecting  the  "realism"  or 
"simulator  fidelity"  and  investigated  how  simulator  fidelity  interacts 
with  training  effectiveness.  The  key  factors  that  affect  training 


effectiveness  were  incorporated  into  the  simulators.  Chapter  II 
describes  this  effort.  An  in-depth  survey  of  simulator  fidelity  is  con¬ 
ducted,  the  definitions,  relationship  to  training  and  the  components  of 
fidelity  are  discussed. 

Based  on  these  discussions,  a  definition  of  fidelity  was  adopted. 
Simulators  of  high,  moderate  and  low  fidelity  levels  are  described  in 
Chapter  III.  It  is  then  argued  that,  for  training  of  cognitive  behavior 
such  as  fault  diagnosis,  a  low  fidelity  simulator  is  reasonably  ade¬ 
quate.  The  description  of  the  low  fidelity  simulator  then  follows. 

This  low  fidelity  simulator  was  used  as  a  vehicle  to  study  the 
fault  diagnosis  behavior  of  experts  on  marine  powerplant  failures. 
Experiments  were  conducted  using  this  simulator.  Chapter  IV  describes 
the  experiments,  including  a  description  of  the  subjects,  the  experimen¬ 
tal  procedures,  and  the  data  collection  technique. 

Chapter  V  proposes  a  framework  of  fault  diagnosis  based  on  obser¬ 
vations  from  experiments.  Factors  affecting  fault  diagnosis  in  the  con¬ 
text  of  the  framework  proposed  are  derived  and  verified.  The  relation¬ 
ship  between  these  factors  and  the  performance  was  found  to  be  statisti¬ 
cally  significant. 

Although  experimental  results  were  statistically  significant,  the 
individual  differences  in  diagnosis  performance  were  striking.  In  order 
to  probe  this  problem  further,  the  relationship  between  the  structure  of 
the  system  and  symptom  knowledge  and  the  mechanism  of  the  fault  diagnos¬ 
tic  process  were  investigated.  In  Chapter  VI,  a  model  which  is 


abstracted  from  data  and  protocols  collected  is  then  proposed  to  account 
for  the  relationship  between  the  knowledge  structures  and  the  diagnostic 
process. 

Finally  Chapter  VII  summarizes  the  findings  of  this  research  and 
discusses  the  implications  for  training  and  simulator  development. 
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CHAPTER  II 

SIMULATOR  FIDELITY 

Traditional  approaches  to  training  on  actual  equipment  are  becom¬ 
ing  more  and  more  prohibitive  because  of  relatively  high  cost  and  their 
limited  ability  to  be  used  for  training  on  unusual  or  potentially  catas¬ 
trophic  situations.  Simulators  are  used  to  cope  both  with  increasing 
costs  and  limitations  on  training  effectiveness.  Spangenberg  (1976) 
discusses  seven  unique  advantages  of  using  simulators  for  training. 
Simulators  can  (1)  provide  immediate  feedback,  (2)  increase  the  number 
of  system  malfunctions  and  emergencies  to  provide  the  trainee  with 
experience  which  would  be  unavailable  on  actual  equipment,  (3)  compress 
time  so  a  complex  sequence  of  tasks  may  be  accomplished  in  the  time  it 
would  take  to  run  through  only  one  or  two  tasks  on  the  actual  equipment, 
(4)  vary  the  sequence  of  tasks  to  maximize  training  efficiency,  (5)  pro¬ 
vide  guidance  and  stimul »«  support  to  the  trainee  in  the  form  of  prompts 
and  feedback,  (6)  vary  the  difficulty  level  to  match  the  skill  level  of 
each  individual  trainee,  and  v7)  provide  the  trainee  with  an  overview 
from  which  the  trainee  may  form  an  overall  understanding  of  the  whole 
situation.  These  advantages,  in  addition  to  the  potential  cost- 
effectiveness  are  the  reasons  why  simulators  have  been  widely  used. 

Simulators  take  various  forms.  These  include,  as  examples, 
mock— photographic  mimics,  vedio  disc  and  computer  simulations. 
Usually  simulators  are  less  expensive  than  real  systems.  However,  large 
mock-ups  like  those  used  to  train  pilots  are  expensive.  The  cost  of  a 
simulator  usually  increases  with  the  fidelity  of  the  simulator  even 


though  increased  fidelity  does  not  guarantee  better  training. 

Overview  of  Simulator  Training 

A  "simulator"  is  a  device  or  a  facility  which  represents  a 
machine,  system,  or  environment  and  its  functions  (Gerathewohl  1969). 
Simulators  have  been  widely  used  to  train  operators  for  maintenance, 
normal  operations,  problem-solving,  and  decision  making.  Simulators 
have  been  constructed  for  a  variety  of  applications.  Clymer  (1980) 
identified  at  least  eight  different  types  :  aircraft,  aerospace,  marine, 
ground  vehicle,  traffic,  process  plant,  power  plant,  and  manufacturing 
plant . 

There  have  been  numerous  research  efforts  to  evaluate  the  effec¬ 
tiveness  of  simulation  for  training.  The  extent  to  which  a  given  simu¬ 
lator  facilitates  the  acquisition  of  appropriate  skills  by  the  trainees 
is  characterized  by  "transfer  of  training"  from  the  training  devices  to 
actual  equipment.  This  is  called  the  "training  effect"  or  "training 
effectiveness" . 

Training  simulators  may  consist  of  several  subsystems  which 
interact  with  each  other.  Since  each  subsystem  may  contain  hundreds  of 
indicators  and  gauges,  it  can  be  expensive  to  construct  and  run  such  a 
simulator.  Therefore,  the  question  of  how  to  utilize  simulators  effi¬ 
ciently  becomes  important. 

Conventionally,  simulators  have  been  employed  in  training  with 
the  assumption  that  higher  fidelity  produces  a  better  transfer  effect. 
However,  research  that  contradicts  this  assumption  has  also  been 
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reported  during  the  past  few  years  (Johnson  1981,  Johnson  &  Rouse  1982, 
Trollip  1977).  The  rest  of  this  chapter  discusses  the  general  issue  of 
simulator  fidelity.  Its  definition,  relationship  with  training,  meas¬ 
urement  and  components  are  described. 

Definitions  of  Fidelity 

"Fidelity"  has  been  used  widely  and  diversely  in  the  simulator 
training  community.  Different  people  use  the  term  with  different  mean¬ 
ings.  Hays  (1981)  reviewed  the  literature  and  noted  the  diversity  of 
meaning.  He  found  that  most  researchers  contrasted  physical  fidelity 
with  non-physical  fidelity.  It  is  non-physical  fidelity  that  attracted 
a  variety  of  names  and  definitions.  Functional  fidelity,  psychological 
fidelity,  task  fidelity  and  behavioral  fidelity  (Hays  1980)  are  among 
the  names  used.  In  general,  most  researchers  agree  that  physical  fidel¬ 
ity  is  not  the  only  factor,  nor  the  main  factor,  affecting  training 
effectiveness.  There  is  also  general  agreement  that  higher 
fidelity  (assuming  it  can  be  measured)  is  not  necessary  for  every  aspect 
of  every  kind  of  training  (Hays  1981). 

There  appears  to  be  a  lack  of  consensus  in  the  research  on  simu¬ 
lator  fidelity,  and  of  an  appropriate  definition  of  what  is  meant  by 
fidelity.  After  reviewing  several  attempts  to  define  simulator  fidel¬ 
ity,  Hays  (1981)  proposed  the  following  definition  : 

Training  simulator  fidelity  is  the  degree  of  similarity 
between  the  training  simulator  and  the  equipment  which  is 
simulated.  It  is  a  two  dimensional  measurement  of  this  simi- 


larity  in  terms  of:  the  physical  characteristics  of  the  train¬ 
ing  simulator,  and  the  functional  characteristics  of  the  simu¬ 
lated  equipment. 


Rouse  (1982)  defined  fidelity: 

"the  precision  with  which  the  simulator  reproduces  the  appear¬ 
ance  and  behavior  of  the  real  equipment." 

These  two  definitions  are  very  similar.  They  emphasize  that  fidelity  is 
a  two  dimensional  concept.  They  also  pointed  out  the  measurement  prob¬ 
lems.  Tasks  and  the  responses  of  the  trainees  were  not  explicitly  con¬ 
sidered  . 


According  to  Kinkade  and  Wheaton  (1972),  the  fidelity  of  a  simu¬ 
lator  consists  of  three  different  components:  (1)  equipment  fidelity  (2) 
environmental  fidelity,  and  (3)  psychological  fidelity.  Equipment 
fidelity  is  defined  as  the  degree  to  which  the  simulator  duplicates  the 
appearance  and  "feel"  of  the  real  system.  Environmental  fidelity  is 
concerned  witn  :ne  cegree  to  which  the  simulator  duplicates  the  sensory 
stimulation,  e.g..  dynamic  motion  cues,  visual  sues,  etc.  Psycno logical 
fidelity  is  simply  the  degree  to  which  the  trainee  perceives  the  simula¬ 
tor  as  a  duplicate  of  the  real  system.  Equipment  fidelity  is  actually 
what  Hays  defined  as  physical  fidelity,  while  the  environmental  fidelity 
and  the  psychological  fidelity  together  approximate  his  functional 
fidelity.  However,  psychological  fidelity  explicitly  recognizes  the 
role  of  the  trainees'  perception  of  fidelity. 


Govindaraj  (1983)  proposed  a  three-dimensional  approach  in  which 


further  descriptions  and  measurements  along  each  dimension  are  dis¬ 
cussed  . 

(1)  Physical  fidelity  : 

Physical  fidelity  is  concerned  with  the  variables 
presented  and  the  forms  they  take  as  well  as  the  environ¬ 
mental  factors  such  as  noise,  vibration  and  thermal  condi¬ 
tions.  Techniques  from  syntactic  pattern  recognition  are 
suggested  to  measure  physical  fidelity. 

(2)  Structural  fidelity  : 

Structural  fidelity  refers  to  the  relationships  between 
subsystems.  Level  of  abstraction,  coupling  of  system 
states,  and  aggregation  of  subsystems  are  the  primary  con¬ 
cerns.  Graph  theoretic  methods  are  suggested  for  measure¬ 
ment  . 

(3)  Dynamic  fidelity  : 

Dynamic  fidelity  refers  to  the  evolution  of  system  states 
over  time  and  their  presentation  to  trainees.  Concroi 
theoretic  methods  are  suggested  for  measurement. 

This  definition  appears  to  be  relatively  comprehensive  and  especially 
useful  for  describing  the  fidelity  of  simulators  of  large  complex  sys¬ 
tems  such  as  power  plant  control  rooms.  Non-physical  fidelity  is  decom¬ 
posed  into  structural  fidelity  and  dynamic  fidelity.  This  provides  a 
way  to  analyze  and  measure  the  functional  aspects  of  a  simulator. 

Despite  the  rigorous  attempts  to  define  simulator  fidelity,  one 
must  keep  in  mind  that  training  effectiveness  is  the  main  concern.  If 


high  fidelity  does  not  imply  high  transfer  of  training,  then  fidelity  is 
not  a  useful  concept.  As  pointed  out  by  Rouse  (1982),  the  key  issue  in 
the  use  of  simulators  is  the  level  of  fidelity  necessary  to  assure 
transfer  of  training  from  simulators  to  real  equipment.  The  study  of 
simulator  fidelity  can  help  clarify  the  following  questions. 

1) .  What  are  the  variables  affecting  the  feeling  of  realism? 

2) .  What  is  learned  and  in  what  way? 

3) .  Can  criteria  for  simulator  design  be  found?  What  should 

the  fidelity  level  of  a  training  simulator  be  to  achieve  a 
given  goal? 

4) .  What  is  the  relationship  between  each  dimension  of  fidel¬ 

ity  and  transfer  of  training  ?  Or,  does  any  meaningful 
relationship  exist  between  these  two? 

A  definition  of  fidelity  is  necessary  if  any  further  study  of 
fidelity  is  anticipated.  It  may  not  be  possible  to  have  a  general  index 
of  fidelity  for  design  purposes.  Nevertheless,  an  explicitly  expressed 
and  commonly  accepted  definition  is  required  for  comparison  of  fidelity 
between  different  simulators. 


Relationship  With  Training 


A  hypothetical  relationship  among  fidelity,  transfer,  and  cost 
was  proposed  by  R.  B.  Miller  (1954)  (see  Figure  2-1).  Very  little 
empirical  data  have  been  collected  to  explore  this  relationship. 
According  to  Miller,  an  increase  in  the  degree  of  simulator  fidelity  is 
accompanied  by  increases  in  both  transfer  of  training  and  cost.  The 


"490  point  of  very 

diminishing  expensive 

f  af  nrne* 


Figure  2-1  :  The  hypothetical  relationship  among  fidelity, 
transfer  and  cost  (modified  from  Miller  1954) 


objective,  both  for  simulate,  resign  and  the  use  of  a  simulator  for 
training,  is  to  find  the  optimal  point  of  intersection  between  fidelity, 
transfer  and  cost  in  each  case>  One  problem  with  Miller's  formulation 
is  that  the  cost  of  a  simulator  could  go  to  infinity  as  its  fidelity 


increases  (Orlansky  1984).  Another  problem  is  the  explicit  assumption 
that  the  amount  of  transfer  increases  with  increasing  fidelity  of  the 
simulator  (Micheli  1972).  Many  researchers  have  found  that  comparable 
training  results  may  be  obtained  with  both  low-  and  high-fidelity  simu¬ 
lators  of  the  same  equipment  (Duncan  and  Shepherd  1975,  Crawford  and 
Crawford  1978,  Johnson  1981).  In  a  study  by  Martin  and  Waag  (1978),  it 
was  shown  that  flight  simulators  with  higher  fidelity  provided  too  much 
information  for  novice  trainees  and  actually  detracted  from  simulator 
effectiveness.  Prophet  (1966)  reported  a  study  that  compared  a  low 
fidelity  simulator  (inexpensive  photographic  mock-up  of  a  cockpit)  with 
that  of  an  elaborate  trainer.  No  significant  difference  between  groups 
was  found.  Despite  these  counterexamples,  Miller's  approach  is  cited 
widely  (Fink  and  Shriver  1978,  Kinkade  and  Wheaton  1972,  Hays  1981). 

A  reformulation  of  Miller's  view  has  been  proposed  by  Orlansky 
(1984).  Even  though  Orlansky 's  hypothetical  model  is  not  fully  sup¬ 
ported  by  empirical  data,  the  known  facts  about  the  cost  of  simulators 
and  ibouc  the  relationship  between  transfer  jf  training  ind  fidelity 
have  been  accounted  for  in  the  model. 

Kinkade  and  Wheaton  (1972)  have  proposed  a  hypothetical  relation¬ 
ship  between  the  degree  of  simulator  fidelity,  types  of  simulator  fidel¬ 
ity  and  the  stages  of  learning  (see  Figure  2-2).  Early  in  the  training 
program  (procedure  training),  the  trainee  does  not  benefit  from  high 
degrees  of  either  physical  or  environmental  fidelity.  However,  as  skill 
is  acquired  (familiarization  training),  there  are  requirements  for 
increases  in  both  physical  and  environmental  fidelity,  with  the  require- 
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Figure  2-2  :  The  hypotneticai  relationship  among  degree  or 
simulation  (fidelity)  and  stage  of  learning 
(Kinkade  and  Wheaton,  1972) 


ments  for  greater  environmental  fidelity  increasing  at  a  faster  rate. 
During  later  stages  of  training  (skill  training),  increases  in  both 
types  of  fidelity  are  desirable,  with  a  requirement  for  higher  levels  of 
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functional  fidelity. 


Johnson  (1981)  was  able  to  show  that  high  fidelity  is  not 
required  for  training  in  procedural  tasks.  Several  other  researchers 
(Johnson  and  Rouse  1982,  Johnson  and  Fath  1984,  Maddox,  Johnson  and  Frey 
1985)  reported  similar  results  for  fault  diagnosis  tasks.  Govindaraj 
(1983)  also  casts  doubt  on  the  necessity  of  high  physical  fidelity  for 
problem  solving  training.  Baum  (1981)  pointed  out  that  empirical  data 
to  support  Kinkade  and  Wheaton's  conjecture  are  lacking  except  those  for 
procedure  training.  Baum,  Riedel  and  Hays  (1982)  conducted  a  study  to 
determine  the  relationship  between  training  device  fidelity  and  transfer 
of  training  for  a  perceptual-motor  maintenance  task.  The  results  indi¬ 
cate  that  physical  similarity  is  a  significantly  more  important  deter¬ 
minant  of  skill  acquisition  than  functional  similarity.  These  experi¬ 
ments  provide  some  support  for  Kinkade  and  Wheaton's  proposal. 

Fink  and  Shriver  (1978)  made  a  point  similar  to  that  made  by  Kin¬ 
kade  and  Wheaton.  They  identified  four  training  stages:  (1)  acquisition 
of  enabling  skills  and  knowledge  (2)  acquisition  of  uncoordinated  skills 
and  unapplied  knowledge  (3)  acquisition  of  coordinated  skills  and  abil¬ 
ity  to  apply  knowledge  and  (4)  acquisition  of  job  proficiency.  They 
claimed  that  different  stages  require  different  levels  of  fidelity  with 
the  first  stage  requirirg  the  lowest  level. 

G.G.  Miller  (1974)  drew  the  following  conclusions  about  the  rela¬ 
tionship  between  fidelity  and  training. 


(1)  High  fidelity  is  never  associated  with  poor  training. 

(2)  Transfer  of  training  is  more  a  function  of  how  the 


simulator  is  used  rather  than  the  degree  of  fidelity. 

(3)  Procedural  task  training  does  not  require  high  fidelity. 
Conclusions  2  and  3  are  shared  by  many  other  researchers  (Duncan  and 
Shepherd  1975,  Johnson  1981,  Johnson  and  Rouse  1982),  while  Conclusion 
1,  as  pointed  out  before,  is  doubtful. 

No  consensus  has  been  reached  on  the  relationships  between  fidel¬ 
ity  and  other  factors  such  as  cost,  training,  and  stage  of  learning. 
The  research  in  this  area  is  not  very  conclusive.  The  difficulty  of 
measuring  fidelity  is  part  of  the  reason  for  the  slow  progress.  The 
next  section  discusses  the  problems  and  the  alternatives  for  the  meas¬ 
urement  of  fidelity. 

Measurement  of  Fidelity 

The  measurement  of  fidelity  is  an  important  step  if  one  wishes  to 
determine  empirically  the  relation  between  level  of  fidelity  and  train¬ 
ing  effectiveness  as  well  as  the  necessary  fidelity  level  of  a  simulator 
for  training  for  a  given  task.  Specific  transfer  of  training  studies 
are  possible  only  after  both  the  simulator  and  the  actual  equipment  have 
been  built.  Nevertheless,  there  is  a  need  to  be  able  to  predict  the 
effectiveness  of  the  training  device  prior  to  construction.  Considering 
the  tremendous  cost  and  man-hours  involved  in  developing  simulators  of 
any  fidelity  level,  one  cannot  be  satisfied  with  a  post  hoc  measure.  A 
measure  of  fidelity  that  correlates  with  the  measure  of  transfer  of 
training  is  a  useful  system  design  guide.  Therefore,  the  purpose  of 
measuring  fidelity  is  the  hope  that  a  predictive  index  can  be  devised 
for  anticipating  the  effectiveness  of  a  training  simulator.  A  reliable, 


predictive  index  of  the  effectiveness  of  a  simulator  will  be  very  useful 
both  for  trainers  and  design  engineers.  Other  things  being  equal,  such 
as  user  acceptance  and  required  levels  of  funding,  they  can  then  choose 
only  those  features  that  possess  high  transfer  value  and  still  meet  the 
training  objective.  However,  in  practice  this  is  very  hard  to  achieve 
due  to  the  difficulty  of  measuring  simulator  fidelity.  One  of  the  dif¬ 
ficulties  is  the  lack  of  generality  of  such  a  measure.  Govindaraj 
(1983)  pointed  out  that  the  environment  and  the  purpose  for  which  the 
simulator  is  to  be  used  have  a  strong  influence  on  fidelity.  Also, 
fidelity  appears  to  be  very  context-specific.  Therefore,  it  may  be  dif¬ 
ficult  to  derive  context-free  measures  of  fidelity. 

Wheaton  et  al.  (1976)  assessed  simulator  fidelity  on  two  dimen¬ 
sions:  physical  fidelity  and  functional  fidelity.  They  discussed  the 
metrics  oc  fidelity  in  the  context  of  constructing  a  model  to  predict 
training  device  effectiveness.  In  their  approach,  a  thorough  task 
analysis  of  the  target  system  and  the  simulator  was  conducted.  Subtasks 
j£  the  targec  systam  md  the  simulator  were  then  clearly  identified. 
The  ohysical  fidelity,  for  each  subtask,  between  the  real  system  md  the 
simulator  was  evaluated  by  rating  with  a  scale  that  ranged  from  "no 
resemblance",  "dissimilar",  "similar"  to  "identical".  The  functional 
fidelity  was  evaluated  by  recording  Che  operator's  behavior  in  terms  of 
the  information  flow  from  each  display  to  the  operator,  and  from  the 
operator  to  each  control.  For  each  subtask,  the  type,  amount,  and 
direction  of  information  was  assessed  using  information-theoretic 
methods.  Then  a  four-point  scale  was  applied  by  comparing  the  informa¬ 
tion  metrics  between  the  real  system  and  the  simulator  on  each  subtask. 
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The  underlying  assumption  was  that  the  higher  the  rating  on  the 
assessment  factors,  the  higher  the  transfer  that  would  take  place  and 
the  more  effective  the  simulator.  However,  as  pointed  out  by  Adams 
(1979),  rating  is  very  subjective  and  its  reliability  is  questionable. 
Further  refinement  of  this  assessment  process  was  reported  by  Narva 
(1977),  in  which  the  physical  fidelity  and  the  functional  fidelity  were 
measured  by  rating  with  emphasis  on  behavioral  categories  instead  of  the 
original  subtasks.  Some  of  the  behavioral  categories  used  include  rule 
learning  and  use,  detection,  symbol  identification,  decision  making, 


Caro  (1970)  advocated  a  procedure  called  Equipment  Device  Task 


Commonality  Analysis  in  which  the  measurement  of  fidelity  was  conducted 
by  assessing  the  similarity  of  S-R  relationships  in  the  real  system  and 
the  simulator.  Positive  transfer  was  assumed  to  occur  when  both  stimuli 
and  responses  were  similar.  Negative  transfer  was  predicted  when  the 
stimuli  were  similar  but  the  responses  were  different.  This  is  similar 
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Components  of  Fidelity 

As  pointed  out  in  the  previous  discussion,  "fidelity"  is  a 
mutli-dimensional  concept.  An  operational,  comprehensive  definition  may 
be  difficult  to  obtain.  However,  the  building  blocks  of  fidelity  have 
been  widely  noted  and  studied  for  a  long  time.  These  are  the  design 
features  of  a  simulator.  Some  of  them  are  discussed  below.  This  list 
is  definitely  not  exhaustive. 

Stress 


There  is  no  doubt  that  stress  is  experienced  by  most  operators  of 
any  real  system.  However,  as  Duncan  and  Shepherd  (1975)  pointed  out,  it 
is  not  clear  how  or  if  stress  can  be  simulated  during  training.  There 
are  at  least  three  types  of  stress.  First,  there  is  the  feeling  of 
danger.  Creating  this  type  of  stress  on  a  simulator  during  training  is 
very  difficult.  Second,  there  is  the  threat  of  hazard  or  sanction. 
This  form  of  stress  can  only  be  simulated  by  manipulating  reward  as  a 
consequence  of  performance.  Third,  there  is  time  stress.  This  can 
easily  be  introduced  into  the  training  task,  but  may  alter  the  trainee's 
perception  of  the  task.  Not  much  is  known  about  how  to  incorporate 
stress  into  simulator  training  or  if  its  presence  contributes  to  ade¬ 
quate  training.  While  such  an  assessment  is  quite  subjective,  anyone 
who  has  spent  a  good  deal  of  time  in  a  flight  simulator,  or  nuclear 
powerplant  simulator  will  attest  to  the  high  stress  feeling. 


Environment 


Noise  is  distracting  especially  in  complex  tasks  that  require 
close  attention  and  concentration  (Finkelman  1975).  Improper  lighting 
(Tinker  1943),  temperature  (Pepler  1972),  etc.  degrade  human  perfor¬ 
mance.  However,  how  much  these  affect  the  fidelity  level  or  how  much 
they  contribute  to  the  training  effects  is  a  matter  difficult  to  esti¬ 
mate.  While  noise,  inappropriate  lighting,  and  temperature  may  degrade 
general  performance,  systematic  noise  or  unusual  heat  or  temperature  are 
repeatedly  reported  to  be  of  great  help  for  failure  detection  and  diag¬ 
nosis.  Many  trainees,  designers  and  experienced  operators  admit  the 
possibility  of  using  unusual  environmental  changes  as  a  clue  to  detect 
or  diagnose  the  failure.  Vibration  has  been  given  the  same  appraisal 
(Longman,  Phelan  and  Hansford  1981,  McCallum  and  Rawson  1981,  Jaspers 
and  Hanley  1980,  Semple  et  al.  1981,  Martin  and  Wagg  1978). 

Layout 


Panel  layout,  display  size  and  even  the  coloring  of  instruments 
are  considered  to  be  important  factors  that  affect  the  feeling  of  real¬ 
ism.  More  importantly  is  the  relative  distance  and  the  relative  posi¬ 
tion  between  gauges,  annunciators  and  status  indicators  (Fowler  et  al. 
1968).  Duncan  and  Shepherd  (1975)  argued  that  the  trainees  may  develop 
strategies  that  heavily  depend  on  patterns  of  the  presented  stimuli. 
The  size  of  the  display  may  influence  the  amount  of  '  it  rmation  the 
trainee  can  process  at  any  one  time.  The  relative  distance  between 
gauges  and  the  relative  position  of  stimuli  may  affect  the  pattern 
recognition  process.  However,  Duncan  and  Shepherd  pointed  out  that  the 


influence  of  such  factors  is  unkown. 

Wholeness 

A  full-scale  simulator  provides  all  aspects  of  system  training, 
while  a  part-task  simulator  presents  only  selected  parts  of  the  full 
system  to  the  trainees.  The  benefit  of  a  part-task  trainer  is  that  some 
particularly  important  subsystem  such  as  the  turbine  or  the  boiler  may 
be  represented  with  greater  physical  fidelity  and  provided  for  training 
before  coping  with  the  entire  system.  However,  the  functional  fidelity 
may  be  affected  due  to  the  isolation  of  a  particular  subsystem.  Curry 
(1981)  observed  that  detection,  diagnosis  and  remedial  action  are  gen¬ 
erally  assumed  to  be  three  separate  tasks.  Therefore,  training  on  each 
one  can  be  accomplished  independently  without  too  much  trouble.  Rouse 
(1981)  found  that  a  particular  logical  judgement  process  is  especially 
important  for  effective  fault  diagnosis.  Abstracting  this  logical  pro¬ 
cess,  he  developed  a  context-free  task,  TASK,  which  is,  in  some  sense,  a 
decomposition  of  the  fauit  diagnosis  behavior.  He  demonstrated  positive 
cransrer  or  training  from  TASK  to  a  real  system.  Hasmussen  (.1980)  pro¬ 
posed  a  criterion  f or  cue  decomposition  of  a  complex  function.  He 
obsered  that: 

"...break-down  of  complex  functions  is  only  acceptable  if  the 
performance  is  paced  by  tne  system,  i.e.,  cues  from  the  system 
serve  to  initiate  elementary,  skilled  sub-routines  individu¬ 
ally  and  to  control  their  sequence.  This  is  the  case  in  many 
manual  Casks,  e.g.,  mechanical  assembly,  but  can  probably  also 
be  arranged  in  more  complex  mental  tasks  by  properly  designed 
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interface  systems."  (p.  92) 

The  influence  of  the  part-task  trainer  on  complex  mental  tasks,  such  as 
fault  diagnosis  and  problem  solving,  is  not  yet  clearly  understood. 
However,  the  unverified  conjecture  is  that  wholeness  is  not  a  crucial 
fidelity  factor. 


Dynamics 


Most  real  systems  are  dynamic,  as  are  most  simulators.  However, 
static  simulators  have  been  used  increasingly  in  the  past  few  years 
(Duncan  and  Shepherd  1975,  Shepherd  et  al.  1977,  Hunt  and  Rouse  1981, 
Johnson  and  Rouse  1982).  Static  simulators  only  allow  the  operators  to 
check  the  system  status,  while  dynamic  simulators  accept  control  com¬ 
mands  and  execute  them.  There  is  no  doubt  that  dynamic  simulators 
describe  the  object  task  better  than  static  simulators  do,  but  how  much 
better  is  a  question  unanswered.  Forbus  and  Stevens  (1981)  indicated 
that  there  is  a  growing  amount  of  evidence  that  human  understanding  of 
physical  systems  is  based  on  qualitative  models  of  those  systems.  This 
evidence  comes  from  psychological  studies  'Larkin  et  si.  .980)  ana  as 
supported  by  success  in  artificial  intelligence  in  ictuaily  constructing 
systems  Chat  reason  about  physical  situations  using  qualitative  models 
(deKleer  1979,  Forbus  1980).  Govindaraj  (1983)  proposed  a  qualitative 
approach  to  modeling  i  complex  dynamic  system.  This  approach  may  pro¬ 
vide  a  way  to  associate  the  level  of  dynamic  fidelity  with  an  explicit 
training  effect.  However,  there  is  no  empirical  data  to  support  the 
transfer  effect  of  the  qualitative  dynamic  simulator. 
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Abstraction 


A  physical  system  can  be  represented  mentally  in  different  forms 
(Rasmussen  1979,  Rasmussen  1985).  Simulators  may  be  constructed  to 
represent  the  physical  system  at  different  levels  of  abstraction.  On 
the  bottom  of  the  hierarchy  is  the  realization  of  the  physical  com¬ 
ponents  in  detail,  analogous  to  a  system  mock-up.  The  higher  the  model 
stands  in  the  hierarchy  by  aggregating  elements  into  larger  units  or  by 
abstracting  through  functional  properties,  the  less  the  physical  fidel¬ 
ity  becomes.  A  system  block  diagram  is  an  example  of  a  more  abstract 
simulator.  Each  level  of  abstraction  possesses  its  own  set  of  symbols 
and  syntactic  rules.  Abstract  simulators  may  be  more  effective  in 
training  for  fault  diagnosis  due  to  the  absence  of  irrelevant  cues. 
Rasmussen  argued  that  shifting  between  levels  of  abstraction  for  suit¬ 
able  strategy  may  be  helpful  for  problem  solving.  This  implies  that 
training  under  lower  physical  fidelity  and  higher  abstraction  level  may 
transfer  well  to  higher  physical  fidelity  and  lower  abstraction  situa¬ 
tion.  The  fact  that  diagnosis  can  be  viewed  as  a  top-down  process  may 
explain  why  lower  physical  fidelity  and  higher  abstraction  level  simula¬ 
tors  could  perform  better  in  this  type  of  training.  Therefore,  func¬ 
tionally  speaking,  it  is  hard  to  decide  which  one  has  higher  fidelity. 

The  value  of  simulators  of  different  abstraction  levels  may  be 
different  for  different  levels  of  trainees.  Kriessman  (1981)  speculated 
that  simulators  of  different  fidelity  level  may  achieve  the  training 
effect  differently.  A  high  fidelity  simulator  is  good  for  more  experi¬ 
enced  trainees,  while  a  low  fidelity  simulator  is  better  for  less 


experienced  ones.  However,  it  is  still  an  open  question  as  to  whether 
the  use  of  simulators  of  different  abstraction  levels  may  provide  the 
operator  with  different  skills  or  the  same  type  of  skills  but  in  a 
degraded  mode. 

State  Variables 

Most  of  the  state  variables  in  a  real  system  are  presented  in  a 
continuous  manner  via  gauges  and  meters,  while  for  simplification,  some 
simulators  may  represent  the  state  variables  in  discrete  language  such 
as  high/medium/ low  or  on/off.  Internally,  the  human  seems  to  process 
information  in  a  discrete  manner  (Kuipers  and  Kassirer  1984),  especially 
when  logical  reasoning  is  involved.  He  may  classify  information  into 
several  finite  sets.  Presenting  information  in  a  discrete  manner  may 
not  result  in  a  loss  of  information  as  long  as  the  classifying  scheme 
matches  the  human's  internal  model. 

The  increasing  use  of  CRTs  for  display  in  simulators  introduces 
difficulty  in  presentation  of  state  variables  because  of  size  con¬ 
straints.  The  most  common  strategy  is  to  use  serial  presentation 
instead  of  parallel  presentation  which  is  the  usual  way  information  is 
transferred  to  the  operator  in  a  real  system.  However,  considering  the 
human  as  a  limited  information  processor,  this  restriction  may  not  be  as 
serious  a  fidelity  problem  as  it  first  appears.  The  attention  span  for 
human  beings  is  well  known  to  be  narrow  and  varying  in  time.  The  state 
variables  in  a  real  system,  though  presented  simultaneously,  are  possi¬ 
bly  processed  in  a  serial  manner  -  perhaps  chunk  by  chunk.  However, 

how  serial  presentation  of  state  variables  affect  fidelity  may  depend  on 


the  type  of  simulators  used. 


Summary 

Several  attempts  have  been  made  to  define  "fidelity".  Hays 
(1981)  proposed  a  functional  fidelity  vs.  physical  fidelity  approach. 
Rouse  (1982)  suggested  a  similar  idea.  Govindaraj  (1983),  oriented 
toward  an  operational  definition,  decomposed  functional  fidelity  into 
structural  fidelity  and  dynamic  fidelity.  Lack  of  empirical  studies  of 
fidelity  issues  makes  it  difficult  to  develop  a  useful  definition  of 
fidelity.  A  generally  accepted  assertion  is  that  higher  fidelity  does 
not  guarantee  better  transfer.  Kinkade  and  Wheaton  (1971)  conjectured 
that  the  fidelity  requirement  varies  with  the  stages  of  learning.  Gen¬ 
erally,  it  is  proposed  that  procedural  tasks  do  not  require  as  high  a 
fidelity  as  visual-motor  skills  do. 

Wheaton  (1976)  and  Narva  (1977)  developed  a  predictive  index  for 
transfer  effect  based  on  the  measurement  of  fidelity.  Task  analysis  of 
both  the  real  system  ind  the  simulator  is  the  foundation  for  measure¬ 
ment.  Rating,  so  far,  is  *mp loved  in  almost  every  fidelity  metric.  A 

more  objective  metric  based  on  system  characteristics  is  an  important 
future  research  topic.  Several  factors  that  affect  fidelity  were  also 
discussed.  A  brief  summary  is  given  below. 

(1)  It  is  very  difficult  to  include  stress  in  the  simulator. 

(2)  Environmental  factors  such  as  noise,  lighting,  temperature, 
motion  and  vibration  are  annoying  but  may  be  treated  as  diag¬ 
nostic  aids.  Inclusion  of  these  variables  does  increase 


fidelity,  but  the  cost-effectiveness  of  including  them  in  a 
simulator  has  long  been  challenged. 

(3)  Layout  may  affect  the  strategy  used  by  trainees. 

(4)  The  important  issue  in  the  use  of  part-task  simulators  is  the 
decomposibility  of  the  casks. 

(51  Dynamic  features  may  not  be  crucial  in  training  for  fault 
diagnosis.  Several  studies  indicated  that  the  human  reasons 
in  a  qualitative  rather  than  quantitative  way. 

(6)  It  may  be  beneficial  to  vary  the  level  of  abstraction  of  the 
simulator  depending  upon  the  level  of  skill  of  the  trainee. 

(7)  In  a  real  system,  the  state  variables  are  presented  simul¬ 
taneously,  although  humans  may  not  be  able  to  process  all  of 
this  information  at  once.  As  a  limited  information  proces¬ 
sor,  human  operators  may  do  well  with  serial  presentation  of 
the  state  variables. 

In  general,  physical  fidelity  may  be  important  for  training  for 
normal  operations  while  non-physical  fidelity  plays  a  more  important 
role  in  training  for  troubleshooting,  especially  those  involving  cogni¬ 
tive  skills  (Johnson  1981,  Johnson  and  Souse  1982).  Following  this 
argument,  we  use  a  low  fidelity  simulator  of  a  marine  powerplant  in  our 
research  to  study  fault  diagnosis  behavior.  This  low  fidelity  simulator 
is  in  a  family  of  simulators  with  increasing  fidelity  levels  (low, 
moderate  and  high),  fhe  low  fidelity  simulator  has  moderate  structural 
fidelity  and  low  fidelity  on  both  physical  and  dynamic  fidelity  accord¬ 
ing  to  the  definition  of  fidelity  used  by  Govindaraj  (1983).  In  the 
next  chapter  the  low  fidelity  simulator  is  described  in  detail, 


following  a  comparison  of  the  low,  moderate,  and  high  fidelity  Simula- 


CHAPTER  III 


THE  LOW  FIDELITY  SIMULATOR 

Simulators  can  be  designed  at  various  levels  of  complexity  and 
realism  relative  to  the  real  systems.  The  degree  of  realism  and  close¬ 
ness  with  which  a  simulator  resembles  the  actual  system  can  be  formal¬ 
ized  in  terms  of  the  concept  of  fidelity  of  simulation.  The  issue  of 
defining  simulator  fidelity  was  discussed  in  Chapter  II;  the  difficulty 
of  defining  lidelity  and  the  components  of  fidelity  were  considered  in 
detail.  A  working  definition  of  fidelity  was  proposed  and  used  to  clas¬ 
sify  simulators  (Govindaraj  1983). 

Fidelity  was  treated  as  a  three-dimensional  concept:  physical, 
structural  and  dynamic  fidelity.  Physical  fidelity  is  concerned  with 
the  variables  that  are  displayed  and  in  how  much  detail  they  are  given. 
This  includes  environmental  factors  such  as  noise,  vibration  and  thermal 
conditions.  Structural  fidelity  refers  to  the  nature  of  relationships 
between  various  subsystems  that  make  up  the  system,  including  feedback 
and  feedforward  connections,  and  hierarchical  relationships.  Similarly, 
the  evolution  of  the  system  states  over  time,  and  their  representation 
can  be  characterized  by  dynamic  fidelity. 

Comparisons  of  Three  Simulators 

To  understand  how  simulator  fidelity  affects  human  fault  diag¬ 
nosis  behavior,  experiments  are  needed  using  simulators  at  different 
fidelity  levels.  This  chapter  considers  oil-fired  marine  powerplant 
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simulators  at  high,  moderate  and  low  fidelity  levels  based  on  the  defin¬ 
ition  presented  above. 

A  high  fidelity  simulator  (HFS)  is  available  at  Marine  Safety 
International  (MSI),  New  York.  Partial  view  of  the  simulator  is  shown 
in  Figure  3-1.  This  simulator  has  two  boilers  of  equal  size  with  two 
burners  per  boiler.  Two  feedpumps  supply  water  to  the  boilers.  There 
is  a  high  pressure  turbine  and  a  low  pressure  turbine.  The  turbines  are 
controlled  by  a  throttle  control  system  at  the  control  console.  Behind 
the  control  room  where  the  control  console  is  housed,  the  engine  room  is 
simulated  using  components  such  as  boilers,  feedpumps,  and  vacuum  pumps. 
The  environment  is  simulated  with  appropriate  noises,  humidity,  and  tem¬ 
perature.  The  entire  simulator  is  run  by  a  PDP  11/70  minicomputer. 

Gauges,  alarms  and  status  indicators  are  displayed  on  the  control 
console.  There  are  gauges  for  temperatures,  pressures  and  levels  of  the 
boilers  and  other  subsystems  such  as  the  condenser  system.  Status  indi¬ 
cators  can  be  grouped  according  to  their  functions.  There  3re  12 
groups:  throttle  :oncroi,  main  turbine  oearing  temperatures,  3T3D  toiler 
burner  management,  ?QRT  boiler  burner  management,  salinity,  33TG  bearing 
temperatures,  bleed  status,  lube  oil,  condenser,  turbine  viax  panel, 
fuel  oil  and  feedpump  systems.  Six  panels  of  alarms  are  shown  in  the 
middle  section  of  the  console.  Each  panel  contains  a  number  of  indica¬ 
tors  which  are  lighted  as  necessary  to  display  the  states  of  components 
and/or  subsystems.  They  are  alarms  for  auxiliary,  STBD  boiler,  PORT 
boiler,  feedwater,  throttle  control  and  vacuum  systems.  There  are 
roughly  100  gauges,  200  alarms  and  200  status  indicators  in  the  entire 


system.  This  simulator  has  high  fidelity  along  all  three  dimensions. 
Besides  the  apparent  physical,  dynamic  and  structural  fidelity,  schemat¬ 
ics  that  describe  the  structure  of  the  system  are  also  available. 

A  moderate  fidelity  simulator  is  being  developed  at  Georgia  Tech 
(Govindaraj  1984)  in  which  the  dynamics  of  major  subsystems  are 
represented  qualitatively.  Even  though  the  physical  fidelity  of  this 
simulator  may  be  low,  fairly  high  structural  fidelity  and  moderate 
dynamic  fidelity  are  achieved.  A  low  fidelity  simulator  called  FAIL 
(FAULT-based  Aid  for  Instruction  and  Learning)  (the  term  FAULT  is 
described  in  the  next  section)  has  been  developed  that  runs  on  an  APPLE 
II  computer.  System  information  is  displayed  to  the  operator  upon 
request.  Schematic  diagrams  of  important  subsystems  are  provided  on 
paper.  Even  though  the  physical  and  dynamic  fidelity  are  very  low,  the 
structural  fidelity  is  close  to  the  high  and  moderate  fidelity  simula¬ 
tors.  Comparison  of  these  three  simulators  in  terms  of  the  three  dimen¬ 
sions  of  fidelity  is  summarized  in  Table  3-1. 

Aithougn  HFS  has  nigh  fidelity  along  all  three  dimensions,  :ne 
hign  cost  of  construction  and  operation  and  tne  difficulties  m  aata 
collection  have  prevented  it  from  being  used  as  a  vehicle  to  study  fault 
diagnosis  behavior.  As  discussed  in  Chapter  II,  high  non-physical 
fidelity  may  be  sufficient  for  training  of  cognitive  or  procedure  type 
of  tasks.  In  this  thesis  the  fault  diagnosis  behavior  on  the  low  fidel¬ 
ity  simulator,  FAIL,  which  has  relatively  high  structural  fidelity  is 
described.  The  details  of  FAIL  experiments  in  which  experienced  marine 
engineers  participated  as  subject,  are  described  next. 
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Table  3-1:  Comparison  of  three  simulators 


Physical 

Fidelity 

Structural 

Fidelity 

Dynamic 

Fidelity 

Low  Fidelity 
Simulator 

Low 

Moderate 

Low 

Moderate  Fidelity 
Simulator 

Low 

Moderate 

Moderate 

High  Fidelity 
Simulator 

High 

High 

High 

The  Low  Fidelity  Simulator  —  FAIL 

This  low  fidelity  simulator  runs  on  an  APPLE  II  Plus  microcom¬ 
puter,  with  64K  bytes  of  memory,  an  80-column  card  and  a  real  time 
clock.  It  is  used  to  accept  inputs  from  the  subject  and  to  display 
powerplant  status  and  other  relevant  information  (Figure  3-2). 

In  addition,  a  number  of  hardcopy  diagrams  provide  system 
schematics  (e.g.,  Figure  3-3)  (see  Appendix  D)  at  various  levels  of 
detail.  Also  provided  is  a  list  of  commands  (see  Appendix  F)  used 
interactively  to  check  for  the  status  of  gauges,  alarms  and  indicators. 


i 


i 
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The  simulator  evolved  from  FAULT  (Framework  for  Aiding  the  Under¬ 
standing  of  Logical  Troubleshooting)  (Hunt  and  Souse  1981).  FAULT  is  an 
interactive  context-specific  troubleshooting  simulator.  In  FAULT,  the 


Operation 
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Quit 
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Figure  3-2:  Main  Screen  Layout  of  FAIL. 

I:  Actions  available. 

II:  System  message  display  window. 

Ill:  Action  designation  window. 

IV:  Record  of  past  actions  and  costs. 

7:  Initial  condition,  symptoms,  indicator 
status  and  recalled  information. 


user  is  given  a  hardcopy  form  of  a  network  representation  of  the  system 
of  interest,  and  a  statement  of  the  symptoms  which  reflect  the  fact  that 
some  component  or  part  in  the  system  is  malfunctioning.  By  interacting 
with  a  CRT  terminal,  the  problem-solver  must  gather  information  in  order 
to  identify  which  of  the  components  or  parts  has  failed.  Problem  solv¬ 
ing  performance  is  judged  by  the  total  cost  to  identify  the  failed  com¬ 
ponent,  calculated  from  the  costs  of  all  actions  and  parts  replaced. 
The  cost  for  each  simulated  action  corresponds  to  the  actual  cost  (or 
time  equivalent)  of  performing  such  an  action  on  a  real  system. 


Interface 


Using  design  concepts  similar  to  those  used  in  FAULT,  FAIL 
presents  failures  to  the  subjects  via  a  CRT  terminal.  All  the  problems 
used  were  prepared  by  the  MSI  experts.  They  were  real  failures.  Each 
problem  presented  a  snapshot  of  a  failure.  At  the  start  of  a  trial, 
initial  conditions  and  the  symptoms  of  a  failure  are  shown  on  the 
display  (Figure  3-2).  For  example,  the  initial  condition  could  be 
"Steaming  at  sea",  and  the  symptoms  of  failure  could  be  "Vacuum  pump 
fail  alarm  sounds".  The  subjects  may  request  more  information  by  typing 


in  appropriate  commands. 
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A  subset  of  the  actual  alarms,  status  indicators  and  gauges  (384 
out  of  approximately  500)  is  used  in  this  simulator.  Gauges,  alarms  or 
status  indicators  for  subsystems  that  are  not  power-related,  such  as 
service  water  pressure,  were  not  included.  Displays  other  than  gauges 
and  meters  are  classified  into  12  groups  by  their  location  and  function. 
Each  group  is  identified  by  a  name.  Alarms  and  status  indicators  are 
given  in  groups  while  gauges  are  shown  individually  because  approxi¬ 
mately  the  same  effort  is  required  to  locate  the  former  in  a  group,  or 
the  latter  separately. 

All  actions  and  their  associated  costs  are  displayed  at  the  lower 
left  portion  of  the  screen.  Cost  for  each  action  reflects  the  time 
involved  to  obtain  the  relevant  information  on  the  real  system. 
Estimated  values  of  various  costs  were  provided  by  marine  engineering 
experts  at  MSI.  The  objective  is  to  minimize  the  cost,  the  actual  time 
spent  and  the  total  number  of  actions  for  diagnosing  the  failure.  At 
the  end  of  a  trial,  the  subject's  total  number  of  actions,  cost  and 
cotal  time  elapsed  are  displayed. 

The  subject  requests  anrormation  aDout  gauges,  alarms  ana  status 
indicators  by  typing  in  appropriate  commands.  Only  the  alarms  and  the 
status  indicators  that  are  activated  by  the  given  failure  are  shown  to 
the  subject  to  simulate  the  illuminated  tiles  in  the  real  system.  The 
subject  keeps  requesting  information  from  the  system  until  he  is  ready 
to  diagnose.  Then  he  identifies  the  failed  component  by  designating  the 
number  shown  beside  the  chosen  component  on  the  schematics.  Gauges  are 
also  numbered,  as  well  as  color  coded  on  the  schematics  to  avoid  confu- 
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sion  with  component  numbers. 


Operator  Inputs 

Each  of  the  seven  possible  actions  is  described  below. 

1.  GAUGES  (G)  : 

A  qualitative  description  such  as  LOW,  HIGH,  NORMAL  or  ABNORMAL  is 
shown  instead  of  the  actual  readings  of  the  gauges.  When  appropri¬ 
ate,  the  time  history  of  the  gauge  of  interest  can  also  be  provided. 
For  example,  a  message  like  "Pressure  drop  to  300  lb  in  10  min"  could 
give  the  subject  some  idea  about  changes  of  the  system  states  over 
time.  The  cost  associated  with  this  action  is  3  units. 

2.  ALARMS  (A)  : 

This  option  displays  the  alarm  status  of  a  particular  subsystem. 
Only  those  illuminated  tiles  in  the  real  system  are  shown  in  the 
right  half  of  the  display.  If  none  of  them  is  turned  on,  an  "ALL  ARE 
NORMAL"  message  will  appear.  When  more  than  22  alarms  are  on  at  one 
time,  the  last  line  displays:  "MORE  ON  NEXT  PAGE.  PRESS  "ENTER"  TO 
SEE  MORE".  The  subject  presses  either  the  "ENTER"  itey,  or  types  "Q" 
to  return  to  the  command  mode.  The  cost  associated  with  this  action 
is  2  units. 

3.  INDICATORS  (I)  : 

The  status  indicators  of  the  subsystem,  that  provided  information 
such  as  "on",  "off",  "standby",  or  "running",  are  shown  on  request. 
The  content  and  format  are  similar  to  "ALARMS".  The  cost  is  2  units. 
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4.  DIAGNOSIS  (D)  : 

The  users  can  check  their  conclusions  and/or  test  their  hypotheses 
using  this  option.  All  the  failures  can  be  traced  down  to  the  physi¬ 
cal  component  level  using  numbers  on  the  system  schematics.  The  cost 
is  50  units. 

5.  LOOK  (L)  : 

This  option  simulates  the  action  of  going  into  the  engine  room  to 
physically  inspect  a  component.  The  subjects  can  observe  the  status 
of  various  components  including  noises,  gauge  readings  and  vibration. 
No  complicated  procedures  are  involved  to  inspect  a  component  under 
this  option.  For  example,  to  respond  to  a  "LOOK"  at  the  feedwater 
regulator,  the  system  may  indicate  "open  10  %".  If  the  components 
the  subjects  choose  to  "LOOK"  at  cannot  be  seen  on  a  real  ship  such 
as  "boiler  tubes",  "sea  scoop"  etc,  the  system  responds  with  "not 
available".  The  cost  is  5  units. 

6.  QUIT  (Q)  : 

This  option  is  used  to  terminate  the  diagnostic  process  without  com¬ 
pletion.  No  cost  is  assigned  to  this  option. 

7.  RECALL  (R)  : 

Since  all  the  actions  and  their  responses  are  stored,  they  can  be 
reviewed  using  this  option.  No  extra  cost  is  incurred  for  doing 
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An  Example  Run 

A  typical  problem  session  with  FAIL  can  be  illustrated  using  Fig¬ 
ure  3-2.  Figure  3-2  shows  the  main  screen  for  Failure  13  in  which  the 
gland  seal  regulator  failed  (component  no.  168).  The  record  of  actions 
performed  by  a  subject  is  shown  in  region  IV.  At  the  beginning,  the 
initial  condition  and  symptoms  are  shown  in  region  V.  They  may  be 
removed  to  make  space  for  indicator  status  etc.  After  studying  the 
symptoms,  the  subject  decided  to  check  the  cooling  water  pressure.  He 
searched  the  command  list  and  found  that  Gauge  35  provided  its  value. 
Then  he  typed  G35  in  region  III.  The  system  responded  by  displaying 
"Main  cooling  water  pressure  normal"  in  region  II. 

He  then  checked  the  condenser  level  (G60) .  The  message  "Main 
condenser  level  normal"  was  displayed  in  region  II.  The  subject  typed 
in  "G31"  and  saw  "Gland  seal  steam  pressure  low"  in  region  II.  To  know 
more  about  the  condenser  system,  he  chose  "19"  (Indicator  status  of  con¬ 
denser  system).  FAIL  displayed  all  relevant  indicator  status  such  as 
"pump  running",  "valve  closed"  etc.,  in  region  V.  After  studying  the 
indicator  status  of  the  condenser  system,  he  might  hit  the  carriage 
return  key  to  move  the  cursor  back  to  region  III  for  further  actions. 

The  next  command  was  "L77"  which  revealed  in  region  II  the  status 
of  a  component,  vacuum  pump,  in  the  engine  room.  The  following  "R"  com¬ 
mand  displayed  all  the  previous  information  in  region  V  for  review.  Ht 
diagnosed  the  failed  component  correctly  via  the  last  command  "D168". 
FAIL  responded  to  the  correct  solution  by  showing  statistics  such  as 
total  time  spent,  total  number  of  actions  and  total  cost.  This 
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statistics  would  also  appear  if  "Q"  was  chosen.  Any  time  the  correct 
solution  or  the  "Q"  command  was  chosen,  a  problem  was  terminated.  FAIL 
would  then  go  to  the  next  problem. 

Summary 

FAIL  is  a  low  fidelity  simulator  of  a  marine  powerplant.  It  was 
used  in  experiments  as  an  integral  part  of  a  training  program  at  MSI. 
Subjects,  who  were  participants  in  a  training  program  at  MSI,  were 
presented  with  realistic  failures  on  FAIL.  They  were  asked  to  find  the 
cause  of  the  failure  while  keeping  the  total  time  spent  and  the  total 
number  of  actions  small.  Subjects  could  interrogate  FAIL  to  find  out 
the  system  status,  which  might  help  the  generation  and  evaluation  of 
diagnostic  hypotheses.  All  actions  and  their  associated  costs  were 
recorded  for  later  analysis.  A  sequence  of  experiments  were  conducted. 
The  next  chapter  describes  the  design  of  the  experiments,  including  a 
description  of  the  subjects,  the  experimental  procedures  and  the  data 
toilection  details. 
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CHAPTER  IV 

DESIGN  OF  THE  EXPERIMENTS 

A  low  fidelity  simulator,  FAIL,  used  in  experiments  to  study 
fault  diagnosis  behavior  was  described  in  the  previous  chapter.  FAIL 
incorporates  realistic  failures  and  system  schematics  to  simulate 
failure  situations  in  a  marine  powerplant.  An  experimental  session  con¬ 
sisted  of  a  number  of  problems.  Each  problem  provided  a  set  of  symptoms 
caused  by  the  failure  of  a  single  component.  Subjects  were  asked  to 
find  the  faulty  component  that  caused  the  failure  symptoms  presented  to 
them  and  at  the  same  time  minimize  the  real  time  spent  and  actions.  The 
following  is  a  description  of  the  subjects,  the  experiments  and  the  data 
collection  procedures. 


Subiects 

Engineers  from  various  fleets  of  a  major  petroleum  processor  who 
attended  m  MSI  training  course  oarticipated  as  subjects.  Thev 
represented  American,  English,  and  Italian  fleets.  Ail  were  experienced 
marine  engineers.  Twenty-one  out  of  28  were  chief  engineers.  The  rest 
were  either  first,  second  or  third  engineers.  The  chief  engineers  had 
more  experience  than  the  first  engineers,  who  in  turn  had  more  experi¬ 
ence  than  the  second  engineers. 

The  training  course  is  designed  to  improve  their  problem  solving 
skills.  It  is  two  weeks  long  and  consists  of  three  parts:  HFS  training, 
laboratory  training  and  lectures.  The  lectures  were  given  during  the 
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second  week,  while  the  subjects  alternated  between  the  HFS  training  and 
laboratory  training  in  the  first  week.  During  the  laboratory  training, 
subjects  were  given  exercises  in  logical  troubleshooting  computer  tasks 
using  TASK  (Rouse  1981),  FAULT  (Hunt  and  Rouse  1981)  and  FAIL.  FAULT  is 
a  context-specific  troubleshooting  task  as  described  in  Chapter  III. 
TASK  involves  fault  diagnosis  of  graphically  displayed  network  of  nodes. 
Each  node  may  have  random  number  of  inputs  and  outputs.  If  a  node 
fails,  it  will  produce  values  of  0  on  all  the  outputs  that  emanate  from 
it.  Given  the  output  status  of  the  last  column  of  nodes,  trainees  are 
asked  to  find  the  faulty  node.  Contrasted  with  FAULT,  TASK  is  a 
context-free  troubleshooting  task. 

Subjects  varied  greatly  in  their  experience  with  and  exposure  to 
computers.  Most  had  never  seen  or  used  any  type  of  computers  before. 
They  were  given  TASK  and  FAULT  for  the  first  two  days  of  the  first  week 
(approximately  three  hours  a  day)  to  familiarize  them  with  the  use  of 
computer  keyboard,  monitor  and  disks,  as  well  as  simulated  diagnostic 
tasks.  On  the  third  day,  a  set  of  three  failures  were  used  in  a  demons¬ 
tration  to  familiarize  them  with  FAIL.  Since  they  had  experience  on 
TASK  and  FAULT,  this  brief  training  was  sufficient. 

The  Italian  subjects  had  some  language  difficulties.  They  were 
allowed  to  ask  questions  about  the  problems.  Help  was  sometimes  pro¬ 
vided  by  a  MSI  expert  who  speakes  fluent  Italian  when  they  had  language 


difficulties . 


Experimental  Procedures 

The  subjects  were  seated  before  APPLE  II  Plus  microcomputers  in  a 
room  that  accomodates  eight  people  comfortably.  This  room  is  used 
exclusively  for  problem  sessions  using  APPLE  II  computers.  They  were 
given  a  list  of  components,  a  set  of  schematics  of  the  system,  and  a 
command  list  (see  Appendix  F).  An  instruction  sheet  (see  Appendix  E) 
was  read  to  them  which  explained  the  experimental  procedures.  For  each 
trial  they  were  asked  to  find  the  solution  by  identifying  the  failed 
component  while  keeping  the  total  cost,  number  of  actions  and  time  to  a 
minimum. 

In  order  to  study  fault  diagnosis  behavior  in  a  realistic  diag¬ 
nosis  environment,  experiments  were  designed  such  that  the  subjects 
would  not  be  unnecessarily  constrained  by  the  study.  The  following  pro¬ 
cedures  were  adopted  to  achieve  this  goal. 

1.  The  experiments  were  integrated  into  a  training  program  designed  to 
improve  subjects'  fault  diagnosis  skills.  The  program  was  two  weeks 
long  and  consisted  of  three  parts:  HFS  training,  lectures,  and 
laboratory  training.  FAIL  was  run  as  part  of  the  laboratory  train¬ 
ing. 


2.  Instead  of  calling  the  experiments  "experiments",  we  referred  to  them 
as  "exercises".  This  produced  a  positive  attitude  towards  the  exper¬ 
iments.  However,  some  subjects  got  too  serious  and  asked  if  the 
results  of  the  "exercises"  would  be  sent  back  to  the  company  for 
evaluation.  To  reduce  their  anxiety,  they  were  told  that  these 


"exercises"  were  designed  solely  for  this  training  program,  and  noth¬ 
ing  would  be  sent  back  to  the  company. 

3.  Subjects  were  asked  not  to  discuss  the  "excercises"  with  each  other. 
They  were  told  that  all  their  questions  would  be  answered  by  the 
experts  at  MSI  and  discussed  openly  in  the  classroom  at  the  end  of 
the  training  program. 

Experiments  and  Data  Collection 

A  pilot  experiment  and  five  experimental  sessions  were  conducted. 
From  the  pilot  experiment,  it  was  found  that  the  command  "A"  for  check¬ 
ing  the  alarm  status  was  not  useful  since  most  of  the  salient  symptoms 
had  already  been  presented  on  the  initial  screen.  On  the  other  hand,  we 
found  that  an  option  to  simulate  the  behavior  of  checking  the  physical 
status  of  components  in  the  engine  room  could  increase  the  realism  of 
FAIL.  A  new  command  "L"  (for  "Look")  was  then  designed  and  incorporated 
into  FAIL  for  this  purpose.  Therefore,  from  Session  1  on,  command  ”L" 
took  the  place  of  command  ’A".  It  was  also  found  from  one  pilot  experi¬ 
ment  that  some  of  the  failures  (see  Appendix  A)  were  not  realistic.  A 
different  set  of  failure  (see  Appendix  B)  was  compiled  by  experts  at  MSI 
from  reports  of  real  failures.  Throughout  the  experiments,  the  actions 
taken  by  subjects  were  recorded  automatically  by  FAIL. 

No  models  or  frameworks  of  fault  diagnosis  were  assumed  before 
the  design  of  FAIL  and  the  design  of  the  experiments.  A  preliminary 
description  of  fault  diagnosis  was  to  be  based  on  observations  of  sub¬ 
ject  performance.  A  framework  of  fault  diagnosis  performance  was 
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developed  using  the  data  collected  and  observations  made  from  experi¬ 
ments.  The  framework  was  then  used  to  identify  and  verify  factors 
affecting  fault  diagnosis  performance  from  Sessions  1  through  5.  This 
analysis  quantified  the  quality  of  actions  and  then  averaged  these  data 
across  subjects. 

Individual  variation  is  such  a  striking  feature  of  human  fault 
diagnosis  performance  that  any  attempt  to  average  data  across  a  popula¬ 
tion  is  certain  to  mask  features  of  the  experts'  behavior.  It  is  essen¬ 
tial  to  get  enough  data  about  each  individual  subject  to  identify  what 
information  he  has  and  how  he  processes  it.  As  Newell  and  Simon  (1972) 
point  out,  only  the  verbal  behavior  can  dig  deeply  into  the  complexity 
of  cognitive  behavior.  Therefore,  for  Sessions  4  and  5,  in  addition  to 
the  regular  FAIL  diagnostic  task,  some  subjects  were  asked  to  "think 
aloud"  and  the  protocols  were  recorded.  Kuipers  and  Kassirer  (1984) 
argued  that  this  type  of  think-aloud  experiment  is  particularly  sensi- 
j  tive  to  the  natural  control  structure  of  the  subjects'  problem  solving 

method.  The  information  reported  vas  actually  in  the  subjects'  focus  of 
attention  at  the  time  of  diagnosis.  A  model  of  fault  diagnosis  is  pro¬ 
posed  based  on  these  protocols  and  data  from  experiments.  Details  of 
this  model  are  discussed  in  Chapter  VI. 

Descriptions  of  the  experiments  are  given  below. 

Pilot  Experiment 

1 

i 

i  Five  subjects  participated  in  this  experiment;  They  are  referred 


to  as  SI,  S2,  S3,  S4  and  S5  later  in  this  thesis.  The  failures  used  on 


HFS  were  different  from  those  on  FAIL.  Each  subject  was  given  18  prob¬ 
lems  (see  Appendix  A)  containing  one  failure  each.  The  first  three 
problems  were  used  for  training.  Problems  were  presented  to  each  sub¬ 
ject  in  the  same  sequence.  Subjects  were  allowed  to  spend  as  much  time 
as  they  wanted  on  each  problem.  Trials  were  sometimes  interrupted 
because  of  the  HFS  schedule.  The  sequence  was  continued  later  at  the 
point  where  it  was  interrupted.  Subjects  were  allowed  to  ask  questions 
since  some  of  them  had  language  difficulties.  The  "L"  command  was  not 
available  for  this  experiment. 

Session  1_ 

Four  subjects  participated  in  this  session  (S6,  S7 ,  S8,  S9). 
This  session  used  a  set  of  29  failures  (see  Appendix  B),  which  were  dif¬ 
ferent  from  those  used  in  the  pilot  experiment.  By  repeating  five  of 
the  failures,  a  total  of  34  problems  were  presented.  The  sequence  of 
problems  varied  for  each  subject.  The  first  three  problems  were  used  as 
training  trials.  The  "L"  command  was  available  in  this  session. 

Session  2. 

Five  subjects  participated  in  this  session  (Sll,  S12,  S13,  S14, 
S15).  They  were  less  experienced  than  trainees  in  the  pilot  experiment 
and  Session  1.  Most  of  them  were  either  second  or  third  engineers  while 
the  trainees  in  the  pilot  experiment  and  Session  1  were  chief  engineers. 
The  set  of  failures  used  in  Session  1  was  used  in  this  session  except 
that  only  Failure  1  and  Failure  3  were  used  for  training  and  Failure  9 
was  modified  slightly.  The  experimental  environment  was  very  similar  to 
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that  of  Session  1.  However,  the  sequence  of  problems  given  to  each  sub¬ 
ject  was  the  same  for  all  subjects. 


Session  3 


Seven  subjects  participated  in  this  session  C S 1 6  -  S22).  All 
were  chief  engineers.  The  set  of  failures  used  in  Session  1  was  used  in 
this  session. 


Session  4:  protocr Is  taken 


Six  subjects  participated  in  this  session  (S25  -  S30) .  All  were 
chief  engineers.  The  set  of  failures  used  in  Session  1  was  used  in  this 
session. 


Session  5.:  protocols  taken 


Six  subjects  participated  in  this  session  (S31  -  S36).  They  were 
either  chief,  first  or  second  engineers.  The  set  of  failures  used  in 
Session  1  was  used.  However,  Failures  27  and  29  were  used  for  training 
instead  of  the  first  three.  Also,  the  sequence  of  failures  used  in  Ses¬ 
sions  2,  3,  and  4  was  reversed. 


Summary, 


Twenty-eight  expert  engineers  participated  in  five  experimental 
sessions  as  subjects.  Several  modifications  were  made  based  on  the 
pilot  experiment,  including  commands  available  and  set  of  failures  used. 
Precautions  were  taken  to  ensure  that  subjects  had  positive  attitudes 
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towards  the  experiments.  Actions  were  recorded  automatically  for 
analysis  which  quantified  the  quality  of  actions  and  avenged  across 
subjects.  To  investigate  diagnosis  behavior  more  deeply,  some  subjects 
in  Sessions  4  and  5  were  asked  to  think  aloud  while  solving  problems. 
The  experimental  results  were  analyzed  to  probe  problem  solving 
behavior.  The  next  chapter  describes  a  framework  for  fault  diagnosis. 
This  framework  is  used  to  analyze  the  data  from  Sessions  1  through  5. 
The  implications  of  these  experimental  results  are  also  discussed. 
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CHAPTER  V 

ANALYSIS  OF  EXPERIMENTAL  RESULTS 

In  the  preceding  chapters  we  described  a  low  fidelity  simulator 
of  a  marine  powerplant,  FAIL,  and  how  it  was  incorporated  in  MSI  train¬ 
ing  program  to  run  experiments.  Data  collected  from  the  last  five 
experimental  sessions  were  analyzed  with  help  from  subject-maatter 
experts  at  MSI.  This  chapter  describes  the  results  of  the  data 
analysis.  First  a  framework  for  fault  diagnosis  is  described.  Using 


this  framework,  two  factors  affecting  the  fault  diagnosis  performance 
are  identified  and  tested  for  statistical  significance.  The  implica¬ 
tions  of  these  factors  on  training  are  discussed.  Then,  some  observa¬ 
tions  on  subjects'  responses  are  described. 

A  Framework  for  Fault  Diagnosis 

Fault  diagnosis  can  be  viewed  as  a  backward  reasoning  process 
that  goes  from  symptoms  to  causes.  However,  the  knowledge  accumulated 
from  years  of  education  is  arranged  in  a  forward  reasoning  format  chat 
goes  from  components  (hence  causes)  to  symptoms.  Subjects  are  well 
versed  in  reasoning  from  a  failed  component  to  its  symptoms  based  on  the 
system  topology  and  dynamics.  Given  sufficient  information  about  the 
input  conditions  of  a  component,  expected  output  states  or  influence  on 
the  downstream  components  can  be  derived  if  appropriate  dynamics  is  used 
(see  Figure  5-1).  Knowledge  of  system  dynamics  is  the  basis  of  this 
forward  reasoning  process.  Though  the  dynamics  may  be  complicated,  the 
reasoning  from  causes  to  symptoms  is  relatively  straightforward. 
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However,  backward  reasoning  from  symptoms  to  components  cannot  be  accom¬ 
plished  by  applying  the  system  dynamics  directly  because  rules  of  dynam¬ 
ics  do  not  work  in  reverse  time  order.  In  this  backward  reasoning  pro¬ 
cess,  the  counterpart  of  system  dynamics,  and  hence  the  output  (the 
cause),  is  unknown  (see  Figure  5-2). 

When  a  failure  occurs,  the  faulty  component  causes  deviations 
from  normal  system  states.  Among  these  state  deviations  some  are  obvi- 
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ous  and  manifest  themselves  in  the  form  of  alarms,  or  other  irresistable 
clues,  while  some  are  non-obvious  and  must  be  investigated.  We  call  the 
first  kind  the  "obvious"  symptoms  and  the  second  kind  the  "non-obvious" 
symptoms.  In  most  cases,  the  obvious  symptoms  are  just  alarms 
representing  the  non-obvious  symptoms  that  exceed  acceptable  levels. 


Obvious  Symptoms .  Non-obvious  Symptoms .  and  Causes 


Obvious  symptoms  are  those  presented  usually  via  audio  alarms  or 
flashing  lights  to  attract  the  operator's  attention.  Compensatory 
actions  appropriate  to  these  symptoms  are  often  required  to  maintain  the 
powerplant  in  an  operating  mode.  However,  these  obvious  symptoms  alone 
may  not  provide  enough  information  to  diagnose  the  failures.  The  sub¬ 
jects  need  to  gather  more  information  in  order  to  form  a  hypothesis 
about  the  faulty  component  and  evaluate  the  hypothesis.  To  gather  more 
information  the  subjects  must  interrogate  the  system  for  the  non-obvious 
symptoms . 


The  relationship  between  symptoms  and  causes  is  dynamic  since  the 
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symptoms  could  change  with  time.  However,  at  any  moment,  a  rather  sim¬ 
ple  relationship  between  symptoms  and  causes  seems  to  exist.  That  is,  a 
set  of  obvious  symptoms  may  be  common  to  several  sets  of  non-obvious 
symptoms  each  resulting  from  several  different  causes  (Figure  5-3).  A 
set  of  causes  is  the  collection  of  components  which  are  suspected  to  be 
the  cause  of  observed  symptoms.  It  is  possible  that  several  different 
causes  may  lead  to  the  same  set  of  non-obvious  symptoms.  A  set  of 
causes  may  take  the  form  of  a  subsystem  that  is  characterized  by  some 
clearly  identifiable  function.  For  example,  it  may  be  the  "Condenser 
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Figure  5-4:  Stages  of  Diagnostic  Process 
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causes  forming  the  leaves.  The  diagnostic  process  is  to  find  a  path 
from  the  root  to  the  correct  leaf,  i.e.,  the  faulty  component.  This 
process  can  be  further  broken  down  into  two  stages.  In  the  first  stage, 
given  obvious  symptoms  the  subjects  gather  information  about  the  system 
in  order  to  find  a  set  of  possible  causes  that  explain  all  the  known 
symptoms.  These  symptoms  include  relevant  information  uncovered  during 
the  investigation.  The  available  information  might  result  in  the  elimi¬ 
nation  of  certain  candidates  from  the  suspected  set  of  feasible  causes. 
This  is  like  forming  branches  for  the  symptom-cause  tree.  After  a  set 
of  possible  causes  is  identified,  the  subjects  shift  into  the  second 
stage  in  which  they  try  to  identify  a  narrower  set  of  causes  and  search 
through  the  hierarchy  of  the  chosen  set  of  causes  to  the  faulty  com¬ 
ponent.  Results  from  this  stage  may  be  fed  back  into  the  hypothesis 
formation  stage  to  eliminate  infeasible  candidates.  In  other  words,  in 
the  first  stage,  hypotheses  about  faulty  subsystems  are  formed  while  in 
the  second  stage,  these  hypotheses  are  evaluated  and  tested.  Subjects 
alternate  between  these  two  stages  to  arrive  at  the  solution  (Figure  5- 
4)  . 

Feasible  Sets 

A  preliminary  step  before  any  hypothesis  can  be  formed  is  to 
search  through  the  knowledge  base  to  find  related  symptom-cause  pairs. 
This  is  done  to  build  new  branches  for  the  symptom-cause  tree.  The  col¬ 
lection  of  these  branches  forms  a  feasible  set  of  causes  for  the  given 
failure.  When  members  of  this  feasible  set  related  to  the  failure  are 
found,  new  hypotheses  are  formed.  The  feasible  set  is  modified  as  the 
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diagnostic  process  continues.  Older  branches  are  discarded  if  evidence 
gathered  does  not  warrant  further  consideration.  New  branches  are  added 
as  more  elaborate  reasoning  is  completed.  Although  each  hypothesis  may 
appear  reasonable,  not  every  hypothesis  is  guaranteed  to  be  correct. 
The  failed  hypothesis  may  provide  insights  into  localizing  the  failure, 
and  for  forming  new  hypotheses. 


Factors  Affecting  Fault  Diagnosis  Performance 


Two  factors  affect  the  efficiency  of  a  diagnostic  process.  They 
are  the  initial  feasible  set  and  the  strategies  that  govern  the  transi¬ 
tion  from  the  hypothesis  formation  stage  to  hypothesis  evaluation  stage. 
The  identification  of  "the  initial  feasible  set,  the  transition  stra¬ 
tegies  and  the  analysis  of  the  experimental  results  are  discussed  next. 


Initial  Feasible  Set 

The  initial  feasible  set  (IFS)  consists  of  the  symptom-cause 
pairs  that  the  subjects  considers  before  any  hypothesis  is  formed  and 
evaluated.  It  reflects  the  strongest  symptom-cause  pairs  that  are  asso¬ 
ciated  with  the  given  obvious  symptoms. 


To  study  the  relationship  between  the  initial  feasible  set  and 
overall  performance,  an  operational  definition  and  measure  for  initial 
feasible  sets  were  necessary.  It  was  impractical  to  ask  the  subjects 
what  their  initial  feasible  sets  were.  An  operational  criterion  was  to 
take  the  first  three  actions  as  the  initial  feasible  set  if  no  stage 
shifting  occurred.  Stage  shifting  was  assumed  to  have  occurred  if  a 
subsystem  or  a  component  was  diagnosed  as  the  cause  of  failure.  If 


subjects  transitioned  to  the  hypothesis  evaluation  stage  before  their 
third  actions,  only  the  actions  before  transition  were  taken  as  the  ini¬ 
tial  feasible  set.  According  to  this  criterion,  members  of  an  initial 
feasible  set  might  be  a  component  check,  gauge  reading  or  alarm  status 
inquiry. 

If  the  initial  feasible  set  contained  components  that  were 
related  to  the  faulty  subsystem  or  component,  the  set  was  rated  as 
"good",  otherwise  "bad".  This  description  of  "good"  and  "bad"  is  used 
as  a  convenient  gross  measure  without  implying  any  measure  of  the  amount 
of  "goodness". 

Table  5-1  shows  examples  of  initial  feasible  sets  that  were 
formed  by  subjects  for  Failure  1-6.  Since  the  faulty  component  was  the 
gland  seal  regulator,  the  initial  feasible  sets  formed  by  SI,  S4  and  S5 
were  rated  as  "good"  and  those  by  S2,  S3  were  rated  as  "bad". 


Transition  Strategies 

Two  types  of  strategy  were  observed  to  result  in  transitions  from 
the  hypothesis  formation  stage  to  the  hypothesis  evaluation  stage.  They 
were  the  breadth-depth  strategy  and  the  balanced  strategy. 

(1).  Breadth-Depth  Strategy  (BD): 

Subjects  stayed  in  the  hypothesis  formation  stage  until  further 
symptoms  w  re  found  and  hypotheses  were  formed.  Then  they  switched 
into  the  hypothesis  evaluation  stage.  They  thoroughly  tested  these 
hypotheses  before  they  gave  up  and  went  back  to  the  hypothesis  for¬ 
mation  stage.  In  other  words,  they  conducted  a  broad  search  for 
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candidates  and  did  thorough  checking  once  a  hypothesis  was  formed. 
(2).  Balanced  Strategy  (BL): 

Subjects  did  not  conduct  a  broad  search  for  candidates  in  the 
hypothesis  formation  stage.  They  switched  to  the  hypothesis 
evaluation  stage  although  no  obvious  hypothesis  was  formed.  They 
did  not  seem  to  evaluate  any  single  hypothesis  thoroughly.  They 
tended  to  form  hypotheses  quickly  and  maintain  several  hypotheses 
at  the  same  time. 

To  analyze  the  data,  two  difficulties  had  to  be  resolved. 

(1) .  how  to  distinguish  the  hypothesis  formation  stage  from  hypothesis 

evaluation  stage? 

(2) .  how  to  identify  the  dominant  strategy? 

The  identification  of  stages  was  very  difficult.  The  following  criteria 
were  used: 

(1) .  If  more  than  two  actions  for  a  particular  subsystem  were  checked 

.onsecucivery ,  cnen  Lz  was  assumed  that  the  suojects  were  m  cne 
.lypothesu  evaluation  stage. 

(2) .  If  a  hypothesis  had  been  tested  before,  or  a  particular  component 

or  gauge  had  been  considered  before,  then  later  related  investiga¬ 
tions  were  considered  as  entering  the  hypothesis  evaluation  stage. 

(3) .  Actions  that  could  not  be  classified  using  the  two  criteria  above 

were  regarded  as  actions  in  the  hypothesis  formation  stage. 

Two  more  criteria  were  used  to  help  classify  the  strategy  as  balanced  or 
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breadth-depth. 

(4) .  If  multiple  hypotheses  were  observed,  then  more  weight  should  be 

placed  on  balanced  strategy. 

(5) .  If  each  hypothesis  evaluation  stage  consisted  of  more  than  three 

related  actions,  then  more  weight  should  be  placed  on  breadth-depth 
strategy. 

The  following  examples  illustrate  how  these  criteria  were  used  to 
identify  stages  and  hypotheses.  An  asterisk  before  an  action  denotes 
that  it  is  fault-related. 

Example  1 :  Bad  IFS  and  BL 

Failure  1-13  :  condenser  vacuum  low  alarm  sounds 
Subject  :  SI 
Actions  recorded  : 

1.  G60  Main  condenser  level  (IFS) 

2.  G36  Condensate  pump  pressure  (IFS) 

3.  G34  Salt  water  service  pressure  (IFS) 

4.  *  G27  Condenser  vacuum  pressure  (fault  related  action) 

5.  *  G35  Main  cooling  water  pressure  (fault  related  action) 

6.  D83  Sea  scoop  valve  (hypothesis  1:  circulating  water) 

7.  D78  Main  condenser  (hypothesis  2:  condenser) 

8.  D84  Sea  scoop  (hypothesis  1) 

9.  D80  Main  circulator  discharge  valves  (hypothesis  1) 

10.  G35  Main  cooling  water  pressure 

11.  G19  Deaerator  tank  pressure  (hypothesis  formation) 


12.  G55  LP  turbine  exhaust  temp  (hypothesis  formation) 

13.  G15  PORT  FO  to  burners  pressure  (hypothesis  formation) 

14.  15  Salinity  system  (hypothesis  formation) 

15.  G80  No.  1  Deaerating  feedtank  dump  regulator 

(hypothesis  3:  deaerator  tank) 

16.  D82  Main  circulator  suction  valve  (hypothesis  1) 

17.  D87  Sea  strainer  (hypothesis  1) 

Total  number  of  actions:  17 
Total  cost:  323 

In  example  1,  actions  1,  2  and  3  were  classified  as  the  initial 
feasible  set.  Since  none  of  them  were  related  to  the  cause  of  the 
failure,  this  IFS  was  rated  as  "bad".  Actions  4  and  5  were  additions  to 
the  feasible  set.  These  two  actions  were  related  to  the  fault  directly. 
Action  6  showed  the  first  hypothesis  which  was  related  to  circulating 
water.  Action  7  showed  another  hypothesis  which  was  concerned  with  the 
condenser  itself.  Actions  8  and  9  tested  the  first  hypothesis  further. 
From  action  10  to  action  14  the  subjects  switched  back  to  the  hypothesis 
formation  stage.  Action  15  was  related  to  action  11  and  was  thus 
regarded  as  the  third  hypothesis  which  dealt  with  the  deaerating  tank. 
In  actions  16  and  17,  the  subject  picked  up  the  first  hypothesis  again 
and  obtained  the  solution.  The  subject  maintained  three  hypotheses  at 
the  same  time.  He  alternated  among  hypotheses.  Therefore,  it  was  clas¬ 
sified  as  using  a  balanced  strategy. 

Example  2  :  Good  IFS  and  BD 

Failure  1-13  :  condenser  vacuum  low  alarm  sounds 
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Subject  :  S2 
Actions  recorded  : 

1.  *  G35  Main  cooling  water  pressure  (IFS) 

2.  D81  Main  circulators  (hypothesis  1:  cooling  water) 

3.  RO  Recall 

4.  D83  Sea  scoop  valve  (hypothesis  1) 

5.  D85  Sea  high  suction  (hypothesis  1) 

6.  D86  Sea  low  suction  (hypothesis  1) 

7.  D87  Sea  strainer  (hypothesis  1) 

Total  number  of  actions:  7 

Total  cost:  253 

In  example  2,  the  initial  feasible  set  contained  only  one  member 
which  was  related  to  the  cause  of  the  failure.  Therefore  the  IFS  was 
rated  as  "good".  From  action  2  onwards,  the  subject  formed  a  hypothesis 
about  the  cooling  water  system  and  conducted  thorough  checking  on  it 
until  action  7  when  the  solution  was  found.  The  subject  maintrined  only 
one  hypothesis  ana  did  thorough  checking  on  that  hypothesis.  It  was 
thus  classified  as  using  a  ireadth-depth  strategy. 

Analysis  of  Experimental  Results 

Of  29  failures,  21  were  considered  for  detailed  analysis.  Among 
the  failures  not  considered,  five  were  used  for  training  (1,  2,  3,  27, 
and  29),  two  resulted  in  confusion  due  to  ambiguous  symptoms  (18,  23), 

and  one  was  merely  a  duplicate  (28)  of  another  failure  (13). 


Twenty-eight  marine  engineers  participated  as  subjects  in  five 


experimental  sessions.  Since  the  experimental  conditions  were  very 
similar,  and  the  same  set  of  failures  was  used  throughout  all  experi¬ 
ments,  data  were  analyzed  across  the  subjects  from  different  experi¬ 
ments. 

The  analyses  were  carried  out  along  four  different  directions. 
First,  the  failures  were  classified  into  different  difficulty  levels. 
Second,  performance  between  subjects  using  good  IFS  and  bad  IFS  was  com¬ 
pared.  Third,  performance  between  subjects  using  BD  strategy  and  BL 
strategy  was  compared.  Finally,  performance  from  subjects  using  both 
good  IFS  and  BD  strategy  was  compared  with  the  rest.  These  are 
described  below  in  detail. 

Easy  vs.  difficult  failures 

Means  and  standard  deviations  of  number  of  actions  and  time  were 
computed  for  each  failure.  The  failures  are  plotted  using  means  of  the 
total  number  of  actions  and  time  in  Figure  5-5.  Three  distinct  clusters 
can  be  seen.  Failures  that  required  more  than  '00  seconds  were  classi¬ 
fied  as  aifficult.  Of  the  remaining  failures,  those  requiring  more  than 
10  actions  were  classified  as  moderately  difficult;  the  rest  were  clas¬ 
sified  as  easy.  Cluster  1  contains  Failures  7,  8,  9,  10,  and  19.  Clus¬ 
ter  2  consists  of  Failures  16,  20,  21  and  26.  The  remaining  failures, 
e.g.,  Failures  4,  5,  6,  11,  12,  13,  14,  15,  17,  22,  24,  and  25  are  in 
cluster  3. 

The  easy  failures  were  not  considered  for  further  analyses 
because  the  diagnostic  processes  for  easy  failures  were  too  short  to  be 
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Figure  5-5:  Performance  based  on  total  time  spent  and 
total  number  of  actions. 

Cluster  1:  7,  8,  9,  10,  19 
2:  16,  20,  21,  26 
3:  4,  5,  6,  11,  12,  13,  14 
15,  17,  22,  24,  25 


informative 


Table  5-2  lists  the  statistics  for  each  failure  under  both  meas¬ 
ures.  Two  observations  are  relevant:  large  standard  deviations  and 
varying  sample  sizes.  The  varying  sample  size  reflects  failures  where 


Table  5-2:  Performance  statistics. 


Failure  Diffi-  Sample  No.  of  Actions  Total  Time  (sec) 


no. 

culty 

size 

mean 

sd 

mean 

sd 

4 

28 

6.0 

3.1 

225 

178 

5 

28 

4.3 

2.6 

167 

126 

6 

27 

1.9 

1.5 

82 

73 

11 

25 

5.8 

6.2 

182 

173 

12 

21 

9.0 

8.2 

297 

272 

13 

Easy 

27 

6.6 

7.5 

295 

416 

14 

25 

4.6 

3.6 

163 

102 

15 

26 

5.2 

3.8 

195 

128 

17 

26 

4.3 

5.2 

215 

303 

22 

27 

2.7 

3.5 

101 

101 

24 

28 

3.2 

2.3 

84 

49 

25 

28 

2.5 

1.2 

91 

56 

16 

19 

12.5 

9.5 

445 

356 

20 

Mode¬ 

23 

12.2 

12.1 

499 

464 

21 

rate 

26 

10.7 

8.5 

318 

273 

26 

26 

11.4 

7.1 

484 

484 

7 

25 

21.3 

14.6 

1069 

641 

8 

Diffi¬ 

19 

12.7 

11.0 

893 

1037 

9 

cult 

20 

19.5 

14.7 

748 

780 

10 

17 

14.8 

13.6 

753 

1001 

19 

17 

19.9 

15.5 

1106 

1296 
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Che  subjects  used  Che  "Quit"  command  to  discontinue  the  diagnostic  pro¬ 
cess  without  finding  the  failed  component.  Since  data  from  such  trials 
were  not  used  for  analysis,  the  sample  sizes  were  reduced.  Difficult 
failures  had  smaller  sample  sizes  than  moderately  difficult  or  easy 
failures. 

Large  standard  deviations  in  performance  measures  could  be  the 
result  of  computing  overall  statistics  without  regard  to  the  IFS  and/or 
stage  shifting  strategies.  The  influence  of  these  two  factors  is  inves¬ 
tigated  next. 

Good  IFS  vs ♦  bad  IFS 

Subjects  were  divided  into  two  groups:  one  with  good  IFS,  the 
other  with  bad  IFS.  Statistics  were  computed  for  each  failure  from 
clusters  1  and  2  based  on  the  raw  data.  In  addition,  a  ranked  t-test 
was  also  done  based  on  a  paper  by  Conover  and  Iman  (1976).  Table  5-3 
summarizes  the  results. 

It  t3  clear  chat  suojects  with  good  IFS  performed  better  cnan 
with  bad  IFS.  12  out  of  the  IS  t-tests  taken  from  raw  iata  are  signifi¬ 
cant  at  0.05  level.  The  statistics  from  ranked  data  are  even  better,  17 
out  of  the  18  t-tests  are  significant  at  0.05  level. 

BD  strategy  vs.  BL  strategy 

Instead  of  classifying  subjects  by  the  IFS,  they  were  divided 
into  two  groups  according  to  the  strategies  used.  Statistics  from  the 
raw  data  as  well  as  ranked  data  were  computed  and  t-tests  were  done 


between  BD  strategy  group  and  BL  strategy  group.  Table  5-4  summarizes 
the  results.  Results  were  similar  to  what  was  observed  above. 

Subjects  using  BD  strategy  performed  better  than  those  using  BL 
strategy.  17  out  of  18  t-tests  from  raw  data  are  significant  at  0.05 
level,  while  all  t-tests  from  ranked  data  are  significant  at  0.05  level. 

Good  IFS  and  BD  strateev  vs .  all  the  others 

Subjects  were  divided  into  two  groups  for  testing:  one  with  both 
good  IFS  and  BD  strategy,  the  other  the  remaining  subjects.  The  data 
were  analyzed  as  before.  Table  5-5  summarizes  the  results.  It  is  seen 
that  the  group  with  good  IFS  and  BD  strategy  performed  better  than  the 
others . 


Table  5-3:  Performance  comparison  between  good  IFS  and  bad  IFS.  A: 

number  of  actions.  T:  Time.  designates  a  t-test  for  the 

means  that  was  not  significant  at  0.05  level. 


Failure 

data 

good  IFS 

bad 

IFS 

t-test 

no. 

type 

mean 

sd 

N 

mean 

sd 

N 

t-score 

V 

16 

raw  A 

7 

3.9 

10 

18.6 

10.4 

9 

3.16 

10 

T 

265 

212 

646 

385 

2.63 

13 

ranked  A 

13.3 

4.2 

5.3 

3.5 

4.50 

19 

T 

12.5 

4.7 

7.1 

5.3 

2.33 

18 

20 

raw  A 

9.6 

11.3 

18 

21.8 

10.6 

5 

2.25 

8 

T 

398 

453 

862 

320 

2.60 

11 

ranked  A 

12.3 

5.4 

6.4 

3.9 

2.76 

11 

T 

13.6 

6.7 

6.2 

3.4 

3.35 

17 

21 

raw  A 

7.9 

6.1 

21 

22.2 

7.8 

5 

3.82 

6 

T 

243 

133 

637 

472 

1.85* 

4 

ranked  A 

15.1 

6.4 

4.4 

3.5 

5.10 

14 

T 

15.1 

7.0 

6.2 

6.9 

2.60 

7 

26 

raw  A 

8.2 

4.8 

13 

14.7 

7.7 

13 

2.59 

21 

T 

472 

500 

495 

487 

.12* 

26 

ranked  A 

16.6 

7.1 

9.3 

6.1 

2.77 

25 

T 

14.5 

8.5 

12.5 

6.9 

.66* 

25 

7 

raw  A 

11.7 

17.5 

6 

24.4 

12.6 

19 

1.65* 

7 

T 

613 

583 

1213 

603 

2.18 

10 

ranked  A 

19.2 

8.6 

10.8 

5.7 

2.25 

7 

T 

18.3 

6.7 

11.3 

6.9 

2.21 

10 

8 

raw  A 

5.1 

3.2 

9 

19.5 

11.1 

10 

3.93 

11 

T 

285 

231 

1441 

1183 

3.03 

10 

ranked  A 

14.1 

3.6 

5.9 

3.5 

5.03 

19 

T 

13.4 

4.0 

6.8 

4.9 

3.22 

19 

9 

raw  A 

12.8 

10.0 

13 

31.9 

14.4 

7 

3.11 

10 

T 

429 

343 

1341 

1029 

2.28 

7 

ranked  A 

13 

5.1 

5.4 

3.5 

3.90 

19 

T 

13.2 

5.1 

5.4 

3.6 

3.98 

19 

10 

raw  A 

12.1 

10.6 

15 

35 

21.2 

2 

1.50* 

1 

T 

538 

632 

2370 

2125 

1.21* 

1 

ranked  A 

9.6 

4.7 

3 

2.8 

2.82 

3 

T 

9.8 

4.8 

3 

2.8 

2.90 

4 

19 

raw  A 

3.3 

.6 

3 

23.4 

14.8 

14 

5.07 

13 

T 

77 

15 

1326 

1331 

3.51 

13 

ranked  A 

15.7 

.6 

7.4 

4.0 

7.43 

15 

Table 


Failure 

no. 

16 


20 


21 


26 


7 


8 


9 


10 


-4:  Performance  comparison  between  BD  strategy  and  BL  strategy. 
A:  number  of  actions.  T:  Time.  designates  a  t-test 

for  the  means  that  was  not  significant  at  0.05  level. 


data 

BD  strategy 

BL  strategy 

t-test 

type 

mean 

sd 

N 

mean 

sd 

N 

t-score 

V 

raw 

A 

7.3 

3.3 

12 

21.4 

10.1 

7 

3.59 

7 

T 

251 

214 

779 

301 

4.09 

11 

ranked 

A 

13 

3.7 

3.6 

1.6 

7.67 

18 

T 

12.9 

4.7 

4.9 

2.4 

4.96 

19 

raw 

A 

5.5 

9.3 

14 

22.7 

7.7 

9 

4.81 

22 

T 

247 

371 

891 

294 

4.62 

22 

ranked 

A 

14.4 

4.2 

5.8 

2.9 

5.88 

23 

T 

15.6 

5.9 

6.2 

3.2 

4.98 

22 

raw 

A 

7.6 

4.8 

22 

27.3 

2.2 

4 

12.89 

13 

T 

242 

130 

741 

473 

2.10* 

3 

ranked 

A 

15.0 

6.2 

2.5 

1.3 

8.52 

26 

T 

15.2 

6.9 

3.5 

2.7 

5.91 

17 

raw 

A 

8.2 

4.7 

15 

15.9 

7.6 

11 

3.00 

17 

T 

237 

109 

821 

593 

3.22 

11 

ranked 

A 

16.4 

6.9 

8.2 

5.6 

3.35 

26 

T 

17.4 

5.7 

8.1 

6.8 

3.71 

21 

raw 

A 

13.9 

8.0 

18 

40.3 

9.5 

7 

6.51 

11 

T 

789 

517 

1787 

230 

6.67 

25 

ranked 

A 

16.1 

5.5 

4.1 

2.4 

7.57 

25 

T 

16.1 

6.3 

5.14 

2.4 

6.30 

25 

raw 

A 

8.1 

7.1 

14 

25.6 

9.8 

5 

3.67 

6 

T 

384 

364 

2320 

976 

4.33 

5 

ranked 

A 

12 

4.4 

3.6 

2.4 

5.28 

16 

T 

12.3 

4.4 

3.4 

2.3 

5.72 

17 

raw 

A 

14.1 

9.5 

16 

41 

11.9 

4 

4.21 

5 

T 

471 

324 

1857 

1129 

2.43 

3 

ranked 

A 

12.2 

5.0 

3 

1.9 

5.97 

18 

T 

12.4 

5.0 

3 

2.2 

5.70 

16 

raw 

A 

9.3 

8.0 

13 

32.5 

13.2 

4 

3.32 

4 

T 

317 

242 

2172 

1265 

2.92 

3 

ranked 

A 

10.6 

4.2 

3 

1.8 

5.18 

16 

T 

10.9 

4.0 

2.8 

1.7 

5.81 

16 

raw 

A 

12.6 

10.5 

11 

33.3 

14.5 

6 

3.09 

9 

T 

409 

398 

2383 

1425 

3.32 

6 

ranked 

A 

11.3 

4.0 

4.3 

2.6 

4.35 

17 

T 

11.7 

3.8 

4 

2.6 

4.96 

16 

19 


Table  5-5:  Performance  comparison  between  good  IFS  and  BD  and  all  oth 
ers.  A:  number  of  actions.  T:  Time.  :  designates  a  t 

test  for  the  means  that  was  not  significant  at  0.05  level. 


Failure 

data 

BD  &  good 

IFS 

the 

rest 

t-test 

no. 

type 

mean 

sd 

N 

mean 

sd 

N 

t-score 

V 

16 

raw  A 

6.1 

2.8 

9 

18.2 

9.8 

10 

3.72 

11 

T 

210 

129 

658 

365 

3.64 

12 

ranked  A 

14.2 

3.3 

5.3 

3.3 

5.90 

19 

T 

13.6 

3.5 

6.7 

5.2 

3.41 

18 

20 

raw  A 

2.8 

2.9 

12 

22.6 

9.3 

11 

6.76 

12 

T 

128 

139 

904 

327 

7.30 

14 

ranked  A 

15.7 

2.0 

6 

3.5 

8.09 

17 

T 

17.3 

3.9 

6.1 

3.5 

7.29 

23 

21 

raw  A 

7 

4.6 

20 

22.8 

7.2 

6 

5.10 

7 

T 

236 

133 

592 

436 

1.97 

5 

ranked  A 

15.7 

5.9 

4.2 

3.2 

6.23 

19 

T 

15.5 

6.9 

6.3 

6.2 

3.11 

11 

26 

raw  A 

4.2 

1.9 

6 

13.6 

6.7 

20 

5.60 

26 

T 

183 

105 

574 

517 

3.17 

24 

ranked  A 

22.8 

2.7 

10.0 

5.6 

7.69 

22 

T 

20.2 

5.3 

11.5 

7.1 

3.25 

13 

7 

raw  A 

4.6 

3.0 

5 

25.5 

13.3 

20 

6.44 

25 

T 

403 

308 

1235 

595 

4.35 

16 

ranked  A 

22.6 

2.0 

10.3 

5.9 

7.83 

24 

T 

20.6 

4.3 

11.1 

6.8 

3.90 

12 

8 

raw  A 

4.3 

2.0 

8 

18.8 

10.8 

11 

4.39 

11 

T* 

227 

161 

1373 

11*2 

3.30 

1  1 
*.  ± 

ranked  A 

15 

2.o 

6 

7  7 

6  .oO 

19 

T 

14.3 

3.5 

6.3 

4.7 

3.98 

19 

9 

raw  A 

12.9 

10.0 

13 

31.9 

14.4 

7 

3.11 

10 

T 

429 

343 

1341 

1029 

2.28 

7 

ranked  A 

13 

5.1 

5.4 

3.5 

3.90 

19 

T 

13.2 

5.1 

5.4 

3.6 

3.98 

19 

10 

raw  A 

9.3 

3.0 

13 

32.5 

13.2 

4 

3.32 

4 

T 

317 

242 

2172 

1265 

2.92 

3 

ranked  A 

10.6 

4.2 

3 

1.8 

5.18 

16 

T 

Ji  .0 

4.0 

2.8 

1.7 

5.81 

16 

19 

raw  A 

3.3 

.6 

3 

23.4 

14.8 

14 

5.07 

13 

T 

77 

15 

1326 

1331 

3.51 

13 

ranked  A 

15.7 

.6 

7.4 

4.0 

7.43 

15 
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Discussion 

Based  on  the  experimental  results,  it  appears  that  the  initial 
feasible  set  and  the  transition  strategies  greatly  affect  the  diagnostic 
performance  on  a  low  fidelity  simulator  of  a  marine  engine  control  room. 
The  initial  feasible  set  reflects  the  knowledge  base  of  the  subjects. 
The  better  the  initial  feasible  sets  are,  the  better  is  the  performance. 
Although  it  is  not  clear  how  the  initial  feasible  sets  are  formed,  it  is 
suspected  that  this  may  be  affected  by  the  knowledge  of  the  system  and 
the  structure  of  the  internal  (mental)  model  linking  the  symptom-cause 
pairs . 


However,  a  good  initial  feasible  set  in  itself  does  not  seem  suf¬ 
ficient  to  explain  the  observed  performance.  Good  problem  solving  stra¬ 
tegy  also  plays  an  important  role.  It  was  found  that  a  breadth-depth 
strategy  was  more  efficient  than  a  balanced  strategy.  It  is  worth  not¬ 
ing  that  strategies  are  relatively  context-free,  while  formation  of  the 
initial  feasible  set  is  context-specific. 

The  fact  that  both  initial  feasible  set  and  the  strategy  used 
affected  the  performance  is  very  significant.  It  seems  to  imply  that  in 
a  highly  specialized  field,  problem  solving  ability  is  affected  by  both 
context-free  and  context-specific  factors.  This  agrees  with  the  results 
and  models  of  Rouse  (1983),  and  Rouse  and  Hunt  (1984). 


Summary  and  Conclusions 


Several  interesting  observations  were  made  during  and  after  the 
FAIL  experiments  at  MSI.  The  marine  engineers  who  participated  as  sub¬ 
jects  were  highly  enthusiastic  about  their  interaction  with  FAIL.  Based 
on  their  comments,  and  observations  during  the  experiments  it  is 
apparent  that  FAIL  has  two  major  benefits  as  a  training  tool  for  a  com¬ 
plex  system.  They  are: 


(1)  FAIL  associates  causes  with  symptoms 

In  real  life,  the  pairing  of  causes  and  symptoms  is  a  matter  of 
experience.  FAIL  condensed  these  experiences  into  an  interactive 
computer  course  which  is  easy  to  use  and  implement.  This  type  of 
association  between  causes  and  symptoms  refreshes  their  memory 
about  failures  and  sometimes  updates  their  knowledge  about  the  sys- 


(2)  FAIL  forces  them  to  think 

Unfamiliar  failures  lead  them  into  reasoning  process  which  they  do 
not  usually  undergo  in  their  day-to-day  operations.  The  reasoning 
process  forces  them  to  reorganize  or  restructure  their  knowledge 
about  the  system. 


It  would  be  desirable  to  explore  these  issues  via  rigorous 
transfer  of  training  experiments  (Maddox, Johnson  and  Frey  1985).  How¬ 
ever,  no  plans  for  such  experiments  exist  at  this  time  due  to  the  diffi¬ 
culties  involved  in  evaluating  the  trainees'  real  world  job  performance 
on  a  long  term  basis.  Similar  difficulties  have  been  reported  by  Adams 
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(1979)  and  Blaiwes,  Puig  and  Regan  (1973). 

The  feedback  provided  by  the  trainees  corroborates  the  experimen¬ 
tal  findings  to  a  great  degree.  The  strong  relationship  between  IFS  and 
overall  performance  emphasizes  the  important  role  of  system  knowledge  on 
diagnosis.  Frequent  associations  between  symptoms  and  causes  may  result 
in  the  trainees'  developing  better  IFS.  Better  IFS  should  lead  to 
better  performance. 

The  reason  why  IFS  is  highly  correlated  with  overall  performance 
can  perhaps  be  explained  by  the  anchoring  effect  (Tversky  1974).  Humans 
have  a  tendency  or  bias  toward  the  initial  thought  which  may  create 
"cognitive  tunnel  vision"  (Sheridan  1981).  Therefore,  the  better  the 
IFS  is,  the  better  is  the  performance. 

The  implications  of  these  results  are  twofold.  First,  diagnostic 
training  should  emphasize  the  pairing  of  symptoms  and  causes.  Second, 
it  should  try  to  discourage  trainees  from  making  wrong  hypotheses  in  the 
early  stages  of  a  diagnostic  task.  The  first  is  somewhat  obvious  and 
has  been  carried  out  in  most  training  programs.  However,  it  is  not 
obvious  how  the  latter  can  be  incorporated  into  a  training  program  and 
further  investigation  is  needed. 

Although  trainees  were  not  aware  of  the  existence  of  stage- 
shifting,  they  felt  that  more  thinking  and  planning  could  help  improve 
the  diagnostic  process.  Subjects  who  used  BL  strategy  had  a  tendency  to 
generate  a  large  set  of  plausible  hypotheses  about  failed  components 
that  could  have  produced  the  observed  symptoms.  However,  humans  are 
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observed  to  have  difficulties  in  generating  all  plausible  hypotheses  in 
diagnosing  (Mehle  1982).  Also,  due  to  short  term  memory  limitations 
(Miller  1956),  and  other  cognitive  limitations  in  keeping  track  of  more 
than  one  line  of  reasoning  at  any  time,  it  is  extremely  difficult  to 
hold  several  active  hypotheses  simultaneously.  Maintaining  multiple 
hypotheses  generates  confusion  and  heavy  cognitive  load,  which  reduces 
efficiency. 

The  relationship  observed  between  transition  strategy  and  overall 
performance  indicates  that  context-free  problem  solving  training  is  very 
useful.  In  fact,  some  trainees  reported  that  they  benefitted  from  TASK 
(a  context-free  logical  troubleshooting  environment)  and  FAULT.  TASK 
and  FAULT  were  administered  along  with  FAIL  to  improve  skills  in  logical 
inference.  The  implication  of  this  for  technical  training  is  that  a 
training  program  that  encourages  more  thinking  and  planning  rather  than 
disorganized  diagnostic  behavior  should  be  designed  if  at  all  possible. 

Two  important  observations  can  be  made  concerning  future 
researcn.  The  first  is  now  dynamics  of  a  system  affects  tne  diagnostic 
process.  An  important  difference  between  a  static  system  and  a  dynamic 
system  is  that  the  latter  can  get  worse  as  time  goes  by  since  inap¬ 
propriate  actions  might  be  taken  by  the  subject.  The  framework  might 
need  to  be  modified  to  account  for  this  phenomenon. 

The  second  concerns  individual  differences  among  the  experts. 
Although  the  trainees  have  very  similar  training  on  this  particular  sys¬ 
tem,  they  responded  differently  when  presented  with  failures.  It  is 
conjectured  that  the  way  their  experiences  are  organized  and  integrated 


with  system  knowledge  might  be  responsible  for  the  differences.  In 
other  words,  the  process  in  which  the  knowledge  is  organized  might  play 
an  important  role  in  how  this  knowledge  is  used.  This  involves  the 
acquisition,  structuring,  and  retrieval  of  knowledge.  These  three 
aspects  are  highly  interdependent  (Baddeley  1976)  and,  hence,  none  of 
them  can  be  considered  individually  to  the  exclusion  of  the  others.  A 
comprehensive  model  could  possibly  be  developed  based  on  research  on 
knowledge  acquisition  and  knowledge  application  discussed  below.  These 
issues  and  those  discussed  below  are  considered  in  greater  detail  in 
chapter  VI. 

Knowledge  acquisition  is  concerned  with  how  experiences  help 
build  up  the  knowledge  base.  It  emphasizes  the  learning  process  and  the 
structuring  of  knowledge  in  memory.  The  dynamic  memory  model  of  Schank 
(1982)  and  the  long  term  memory  model  of  Kolodner  (1983)  might  prove 
useful  to  study  this  problem. 

Knowledge  application  deals  with  the  process  of  using  knowledge 
to  make  appropriate  inferences.  Conventional  models  of  human  problem 
solving  assume  that  knowledge  is  organized  into  situation-action  ruies. 
Thus  knowledge  application  is  a  matter  of  matching  situations  to 
actions.  A  rule-based  model  such  as  MYCIN's  (a  medical  diagnostic  pro¬ 
gram)  production  system  (Shortliffe  1976)  is  a  typical  example.  How¬ 
ever,  a  rule-based  model  is  rather  weak  in  explaining  the  inferential 
processes  involved  in  diagnosing.  Davis  (1983)  observed  that  sometimes 
the  diagnostic  reasoning  may  come  "from  first  principles",  i.e.,  from  an 
understanding  of  causality  of  the  device  being  examined.  Several 


researchers  incorporate  this  approach  into  the  knowledge  structure  such 
that  more  sophisticated  application  of  knowledge  can  be  explained. 
Chandrasekaran  and  Mittal  (1983)  proposed  a  diagnostic  problem-solving 
structure  which  has  all  the  aspects  of  the  underlying  knowledge  needed 
for  diagnostic  reasoning  "compiled"  into  it.  de  Kleer  and  Brown  (1983) 
studied  mental  models  of  devices  and  tried  to  derive  inferences  from 
those  models.  These  approaches  try  to  organize  the  relevant  system 
knowledge  in  specific  ways  so  as  to  aid  the  inferential  process. 
Research  along  these  lines  is  expected  to  benefit  the  development  of 
intelligent  computer  aids  for  training.  Chapter  VI  describes  a  modeling 
effort  towards  this  goal. 


CHAPTER  VI 


A  MODEL  OF  FAULT  DIAGNOSIS  PERFORMANCE 

From  the  experimental  results  discussed  in  the  preceeding 
chapters  it  was  seen  that  there  was  significant  difference  in  the 
sequences  of  actions  taken  by  subjects.  This  could  possibly  be  the 
result  of  individual  differences.  Everything  inexplicable  by  the  pro¬ 
posed  hypotheses  or  the  central  tendency  could  perhaps  be  classified  and 
then  discarded  as  individual  differences.  This  may  be  adequate  if  the 
central  tendency  is  overwhelming.  However,  since  there  are  no  obvious 
patterns  and  a  strong  central  tendency  is  absent,  a  different  explana¬ 
tion  is  needed.  The  modelling  approach  discussed  in  this  chapter  pro¬ 
vides  such  an  explanation. 

Models  of  fault  diagnosis  performance  can  be  classified  into  two 
types:  macro  and  micro.  Macro  models  describe  performance  strategies  or 
behavior  patterns.  They  describe  general  problem-solving  rules  that  are 
abstracted  from  observations  of  subjects'  diagnostic  behavior.  A  model 
proposed  by  Rouse  and  Hunt  (1984)  that  uses  topographical  rules  (T- 
rules)  and  symptomatic  rules  (S-rules)  is  a  typical  example.  These 
rules  were  employed  successfully  to  classify  diagnostic  behavior.  They 
explained  "what"  has  been  performed,  but  not  "how"  and  "why"  a  particu¬ 
lar  action  was  chosen  or  formed.  Rouse  and  Hunt  also  proposed  a  fuzzy 
set  approach  to  explain  how  a  particular  rule  is  chosen  in  terms  of  the 
intersection  of  fuzzy  sets  of  recalled,  applicable,  useful,  and  simple 
rules.  However,  no  effort  was  made  to  explore  how  system  knowledge  was 


aquired  or  why  different  weights  were  associated  with  different  alterna¬ 
tives  . 

Contrasted  with  macro  models,  micro  models  consider  individual 
differences.  Therefore,  a  micro  model  should  specify  the  formation  of 
knowledge  and  the  mechanisms  behind  actions.  Most  AI  models  can  be 
classified  as  micro  models  since  they  usually  deal  with  the  knowledge 
representation  and  retrieval  problems  in  a  precise  way.  For  example, 
Schank's  (1982)  approach  to  Memory  Organization  Package  (MOP)  not  only 
specifies  how  memory  is  organized,  but  also  describes  the  mechanism  of 
reminding.  Kolodner  (1983a,  1983b)  proposed  a  computer  model  of  long 
term  memory.  Later  she  applied  this  model  to  medical  diagnosis 
(Kolodner  and  Kolodner  1984)  which  demonstrated  experience's  role  in 
acquiring  expertise.  She  showed  how  medical  knowledge  is  learned,  modi¬ 
fied  and  generalized.  Medical  diagnosis  differs  from  engineering  diag¬ 
nosis  in  one  important  characteristic:  medical  knowledge  about  how  the 
body  functions  is  less  structured  than  engineering  knowledge  about  how  a 
system  works.  Therefore,  there  is  no  clearly  defined  model  to  explain 
Che  patient' 3  body  function.  An  engineering  diagnosis  task  is  defin¬ 
itely  better  defined  and  structured  than  a  psychiatric  diagnosis. 

In  this  chapter,  a  micro  model  is  proposed  that  deals  with 
knowledge  formation  and  the  mechanism  of  fault  diagnosis  behavior  in  a 
marine  powerplant.  Such  a  model  could  help  identify  essential  com¬ 
ponents  of  fault  diagnosis  behavior  and  hence  choose  appropriate  levels 
of  simulator  fidelity  for  training.  A  micro  model  might  also  reveal  the 
limitations  of  human  operators,  and  help  develop  better  aiding  schemes. 
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Kuipers  and  Kassirer  (1984)  pointed  out  that  in  building  an 
intelligent  computer  system  which  deals  with  causal  reasoning,  two  types 
of  constraint  have  to  be  considered:  computational  constraints  on 
knowledge  representation,  and  empirical  constraints  on  expert's  reason¬ 
ing  process.  Computational  constraints  require  that  a  knowledge 
representation  be  computer  implementable .  Empirical  constraints  may 
direct  the  formation  of  the  knowledge  representation  in  a  format  that 
reflects  how  experts  solve  problems. 

The  model  discussed  below  is  based  on  observations  of  real  fault 
diagnosis  behavior.  It  tries  to  incorporate  all  data  and  protocols  into 
a  coherent  model.  The  model  considers  empirical,  constraints  only.  We 
discuss  two  types  of  knowledge  and  their  structures.  Then  the  charac¬ 
teristics  of  diagnostic  behavior  on  the  marine  powerplant  are  summar¬ 
ized.  A  conceptual  entity  called  an  "hypothesis  frame"  is  employed  to 
account  for  the  observed  behaviors.  Finally,  an  integrated  diagnosis 
model  taking  into  consideration  these  two  types  of  knowledge  structures 
and  the  hypothesis  frame  is  described. 

This  model  is  noc  intended  to  be  a  robust  ttieory  of  diagnosis. 
As  noted  by  Williams  et  al.,  (1983),  in  their  paper  describing  mental 
models  of  a  simple  physical  system, 

"We  do  not  claim  that  the  models  represent  a  theory  of  reasoning 
nor  the  simulations  a  test  of  a  theory.  Rather,  the  models  are 
employed  as  tools  to  help  us  structure  our  description  of  the  sub¬ 
jects'  behavior."  (pp.  136) 

This  chapter  tries  to  structure  the  observations  from  subjects'  behavior 


in  real  diagnosis  tasks. 


Knowledge  Representation 

Based  on  observations  and  protocols  (see  Appendix  C)  taken  from 
the  experiments,  two  types  of  knowledge  were  identified:  symptom 
knowledge  and  system  knowledge. 

Symptom  Knowledge 

Troubleshooters  accumulate  handy  rules  through  years  of  experi¬ 
ence.  These  rules  associate  obvious  symptoms  with  non-obvious  symptoms 
or  causes.  In  the  experiments  some  actions  were  generated  fairly 
quickly,  suggesting  a  direct  association  between  symptoms  and  causes. 
For  example,  given  "fluctuation  of  fuel  oil  pressure,"  most  engineers 
immediately  diagnosed  that  there  was  water  in  the  fuel. 

Theoretically,  all  symptoms  can  be  derived  mentally,  given  an 
appropriate  representation  of  the  system  dynamics.  However,  some  symptom 
derivation  processes  may  be  slow  and  error-prone  due  to  human  cognitive 
limitations.  Symptoms  that  indicate  environmental  changes  are  good 
examples.  Knowing  the  dynamics  of  the  system  might  not  help  distinguish 
the  causes  for  noises  of  different  intensity  and  frequency.  Experience 
plays  an  important  role  in  associating  causes  with  environmental  symp¬ 
toms  such  as  noise,  vibration,  smoke,  pressure  fluctuation,  etc. 


The  direct  association  between  symptoms  and  causes  can  be 
represented  with  production  rules  in  the  IF-THEN  format.  When  diagnos- 
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ing  a  system  failure,  subjects  match  the  known  symptoms  with  the 
antecedents  of  the  rules.  If  matches  are  found,  the  action  parts  of  the 
rules  will  be  triggered.  Causes  associated  with  a  particular  symptom 
seem  to  have  high  generality  across  subjects.  For  example,  rule  1  in 
Table  6-1  lists  the  most  popular  causes  that  are  associated  with  the 
symptom  "vacuum  low  at  reduced  speed".  Gland  seal  steam  was  referred  to 
by  20  out  of  27  subjects  in  experiment  two  through  six.  Cooling  water 
system  and  condensate  pump  system  were  referred  to  by  23  and  18  out  of 
28  subjects  respectively. 

There  are  three  characteristics  of  these  rules  that  make  this 
matching  process  efficient. 

1.  Rules  may  be  partitioned  into  groups  such  as  steam-related 
group,  noise-related  group  or  fuel-related  group,  etc.  This 
grouping  scheme  makes  the  rule-matching  process  efficient.  A 
few  examples  of  these  symptom-cause  rules  are  listed  in  Table 
6-1. 

2.  The  relationship  between  symptoms  and  causes  is  an  inverted 
tree  such  as  represented  in  Figure  5-5.  Symptom-cause  branches 
are  prioritized.  The  priorities  may  change  as  experience  accu¬ 
mulates  and  differs  a  lot  from  subject  to  subject.  The  priori¬ 
ties  are  used  to  guide  the  generation  and  evaluation  of 
hypotheses.  An  example  of  a  priority  list  can  be  found  in  the 


rules  of  Table  6-1  which  is  derived  from  the  data  collected. 


Table  6-1:  Examples  of  symptom-cause  rules.  The  actions  for 
a  given  condition  are  shown  in  decreasing  order 
of  priority. 


Rule  1: 

IF  vacuum  low  and  reducing  speed 

THEN  1.  check  gland  seal  regulator  failure  frame 

2.  check  circulating  water  failure  frame 

3.  check  the  condensate  pump  failure  frame 

Rule  2: 

IF  vacuum  low  at  full  speed 

THEN  1.  check  condensate  pump  failure  frame 

2.  check  vacuum  pump  failure  frame 

3.  check  condenser  failure  frame 

4.  check  circulating  water  failure  frame 

5.  check  gland  seal  regulator  failure  frame 


3.  Rules  with  a  large  number  of  conditions  to  satisfy  are  tighter 
than  those  with  a  small  number.  A  tighter  rule  is  usually  pre¬ 
ferred  because  it  means  a  smaller  and  more  accurate  searching 
space  for  possible  causes.  For  example,  rule  1  in  Table  6-1  is 
tighter  than  ruie  2  in  the  same  table.  Therefore,  rule  1  is 
preferred  if  both  conditions  are  satisfied. 


Another  type  of  symptom  knowledge  is  "facts"  about  the  specific 
system  being  diagnosed.  Facts  are  information  about  a  particular  sys¬ 
tem.  An  example  of  a  fact  is  "This  ship  has  been  notorious  on  her  atom- 


izing  steam  system".  Facts  describe  general  observations  of  a  particu¬ 
lar  system  without  clear  association  with  any  causes.  Facts  can  help 
reduce  the  effort  on  the  generation  and  evaluation  of  hypotheses.  How¬ 
ever,  these  are  very  system-specific.  The  symptom-cause  pairings  remain 
the  same  when  the  troubleshooter  is  switched  to  other  ships,  while  the 
facts  may  be  unique  for  a  particular  system. 

System  Knowledge 

An  engineering  system  is  relatively  well  defined.  Although  there 
was  no  clear  evidence  of  a  homogeneous  mental  model  about  the  system 
across  subjects,  there  appear  to  exist  hierarchical  models  of  subsystems 
and  components.  Knowledge  about  the  system  is  organized  at  several 
abstraction  levels.  In  general,  modules  at  higher  abstraction  levels 
are  more  function-oriented,  while  modules  at  lower  abstraction  levels 
have  clear  physical  correspondence  (Rasmussen  1985).  However,  at  lower 
levels,  the  relationships  are  not  entirely  hierarchical  since  certain 
components  and  subsystems  may  be  common  to  more  than  one  system  (Govin- 
daraj  1984).  For  example,  a  steam  system  module  may  be  defined  as  heat 
transfer  unit,  while  a  boiler  itself  may  be  defined  as  a  collection  of 
its  major  physical  components.  The  following  describes  part  of  a  possi¬ 
ble  hierarchical  model  of  the  powerplant. 

The  highest  abstraction  level  may  contain  only  the  major  subsys¬ 
tems  such  as  steam,  condenser,  fuel,  lube  oil,  auxiliary  steam  and  tur¬ 
bine  systems.  A  lower  abstraction  level  in  the  steam  system  may  contain 
the  furnace,  boiler,  economizer,  and  air  supply  system.  possible 


lower  abstraction  level  for  the  boiler  may  contain  the  superheater,  the 
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desuperheater,  the  attemperator ,  the  boiler  drum  and  the  boiler  tubes. 
An  even  lower  abstraction  level  for  the  superheater  may  contain  the 
first  pass  pipe,  the  second  pass  pipe,  the  third  pass  pipe,  the  fourth 
pass  pipe  and  the  fifth  pass  pipe.  A  hypothetical  hierarchical  tree  is 
shown  in  Figure  6-1. 

Each  module  in  this  hierarchical  tree  may  contain  three  types  of 
information:  definition,  function,  and  connections.  Table  6-2  shows  a 
hypothetical  boiler  system  module.  In  this  boiler  system  module,  there 
is  no  need  to  mention  the  details  of  the  furnace  system  such  as  how  the 
fuel  gets  mixed  with  the  air  or  the  role  of  air  registers,  etc.  If  in 
the  process  of  diagnosing,  more  details  about  any  submodules  in  the 
lower  abstraction  level  is  needed,  the  submodule  can  simply  be  accessed 
through  appropriate  links. 

The  definition  and  function  of  a  module  is  described  in  terms  of 
its  descendants  in  the  immediate  lower  abstraction  level  or  modules  on 
other  branches  of  the  tree.  Links  to  modules  of  other  branches  of  the 
tree  denote  the  connections  among  modules  with  different  functions.  For 
example,  the  condenser  may  be  functionally  classified  as  a  part  of  the 
cooling  system.  However,  there  is  a  connection  between  the  auxiliary 
steam  system  and  the  condenser  through  the  35  lb  steam  line.  This  con¬ 
nection  is  shown  on  Figure  6-1  by  the  dashed  line  connecting  the  con¬ 
denser  and  the  auxiliary  steam  system. 

When  subjects  were  asked  to  describe  how  the  system  works  in 
terms  of  higher  abstraction  levels,  almost  everyone  described  it  in  a 
similar  fashion.  This  indicated  that  subjects  at  least  had  a  consistent 


Table  6-2:  The  Boiler  System  Module 


Definition: 

A  boiler  system  transforms  the  heated  water  into  steam. 
Function: 

Feedwater  goes  through  economizer  to  be  heated  before  being 
fed  into  the  boiler. 

The  boiler  is  heated  by  the  furnace  which  mixes  air  and  fuel 
oil  for  burning. 

Connections : 

Inputs  are  feedwater  from  feed  system,  fuel  oil  from 
fuel  oil  supply,  and  air  from' air  supply. 

Outputs  are  steam  to  steam  system,  gas  and  smoke  to  stacks. 


view  about  the  powerplant  at  the  higher  abstraction  levels.  However, 
the  diagnostic  process  varied  from  subject  to  subject.  Here  are  some 
observations  about  the  diagnostic  process. 


Apparent  Characteristics  of  the  Diagnostic  Process 

1.  Diagnosis  proceeds  in  a  hierarchical  manner.  Given  symptoms,  sub¬ 
jects  start  reasoning  at  a  higher  abstraction  level  and  generate 
hypotheses  using  available  information.  Hypotheses  generated  in  a 
higher  level  set  the  direction  of  diagnosis  for  a  lower  abstraction 
level.  The  following  excerpt  from  a  protocol  showed  this  hierarchi- 
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cal  approach. 

L217:  Need  to  find  out  whether  there  is  too  much  demand  for  the 
boiler, 

L218:  Whether  the  boiler  pressure  is  dropping  in  the  auxiliary 
system  or  the  main  system. 

L219:  In  order  to  bring  the  pressure  back,  you  have  to  reduce 
the  load. 

L220:  We  have  to  come  back  to  a  reduced  speed, 

L221 :  To  secure  some  other  steam  demand  and  find  out  the  prob¬ 
lem. 

(Experimenter  intervention  to  keep  conversation  going) 

L223 :  You  have  to  check  to  see  if  both  burners  in  both  boilers 
are  operating  properly. 

L224:  If  the  fuel  oil  is  following  the  demand  and  the  air, 

L225:  If  there  is  not  enough  air  we  won't  get  proper  atomiza¬ 
tion.  I  mean  proper  burning. 

L226 :  Have  to  check  different  pressure. 

L227 :  Whether  the  temperature  the  boiler  is  OK 

L228:  To  see  if  it  would  have  dirty  tubes. 

The  subject  first  diagnosed  the  problem  in  terms  of  functional  con¬ 
cepts  like  "too  much  demand  on  the  boiler"  (L217).  Following  this 
diagnosis  of  a  high  abstraction  level,  the  subject  suggested  some 
remedial  actions  (L219,  L220,  L221).  Then  after  assuring  that  the 
boiler  was  not  overloaded,  the  subject  proceeded  to  diagnose  the 
problem  further  using  "too  much  demand  on  the  boiler"  as  a  guide- 
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line.  He  shifted  attention  to  a  gross  but  concrete  component 
"burner"  (L223).  To  identify  what  was  wrong  with  the  burners,  he 
moved  down  the  hierarchy  tree  to  the  fuel  line  and  air  line  inside  a 
burner  (L224,  L225).  At  this  moment,  he  checked  the  system  state  to 
find  out  the  status  of  the  fuel  and  air  lines  going  into  the 
burners.  They  were  alright;  the  subject  therefore  came  back  to  a 
higher  abstraction  level  to  resume  another  line  of  reasoning  (L226, 
L227). 

Diagnosis  proceeds  with  backward  and  forward  reasoning.  It  is  com¬ 
monly  known  that,  given  symptoms,  subjects  search  for  symptom-cause 
pairs  to  proceed  with  the  diagnostic  process.  This  is  a  backward 
reasoning  process  which  goes  from  symptoms  to  causes.  However,  it 
is  observed  that  subjects  also  use  forward  reasoning  to  help  formu¬ 
late  and  eliminate  hypotheses.  Forward  reasoning  process  allows  the 
subjects  to  reason  through  the  system  dynamics  over  time  starting 
from  a  component  of  interest.  For  example,  a  subject  was  observed 
to  reason  in  the  following  manner. 

L530:  If  cold  water  goes  into  the  feedpump,  it  would  cause  the 
feedpump  to  flash  the  steam,  and  the  pump  would  then  lose 
suction. 

In  this  case  the  subject  mentally  derived  how  the  system  would  react 
if  cold  water  goes  into  the  feedpump.  This  is  a  forward  reasoning 


process 


3.  The  symbolic  descriptions  of  quantities  are  stated  in  qualitative 
terms.  For  example,  high  pressure,  low  vacuum,  enough  flow  etc., 


are  used  to  describe  the  system  variables.  Therefore,  qualitative 
state  descriptions  seem  adequate. 

4.  Knowledge  appears  to  be  chunked:  relevant  symptoms,  inferences  and 
structures  are  grouped  together  mentally  for  easy  access.  Subjects 
showed  great  efficiency  in  evolving  through  system  states  qualita¬ 
tively.  For  example,  when  asked  about  what  would  happen  if  the  35 
lb  line  had  problems,  the  subject  responded  quickly  with  the  dialo¬ 
gue  from  line  L135  through  line  L138  in  Table  6-3. 

The  symptoms  (hearing  the  35  lb  make  up,  hearing  the  feedpump 
speed  up  a  little),  the  inferences  (from  35  lb  failure  to  the  make¬ 
up,  to  the  feedpump  state  change,  to  the  dumping  into  condenser), 
and  the  structures/connections  (the  relationships  between  35  lb 
line,  feedpump,  high  pressure  steam,  low  pressure  steam,  and  the 
condenser),  are  clearly  stated. 

Hypothesis  Frame 

These  four  observations  lead  to  the  construction  of  a  useful  con¬ 
ceptual  entity  called  hypothesis  frame.  A  frame  as  described  by  Minsky 
(1975)  is  a  general  way  of  representing  common  knowledge.  Frames  con¬ 
tain  information  about  many  aspects  of  the  objects  or  situations  they 
describe.  The  "hypothesis  frame"  or  prototype  has  been  used  for  organ¬ 
izing  disease  types  in  medical  diagnosis  research.  CENTAUR  (Aikins, 
1979)  and  PIP  (Szolovits  and  Pauker  1978)  used  "hypothesis  frame"  as  the 


Table  6-3:  Protocols  from  subject  S31  while  solving  Failure  19.  Lines 
beginning  with  "E"  were  interruptions  by  the  experimenter  to 
keep  the  onversation  going.  Lines  beginning  with  "L"  were 
uttered  by  S31. 

Failure  19  (Vacuum  drops  at  reduced  speed)  (S31) 

L100:  First  indication  is  a  lot  of  vacuum  loss. 

L101:  So,  first  thing  I'll  check  will  be  the  gland  seal 
L102:  (to  see)  if  it  has  pressure  after  regulator. 

L103:  You  know  on  the  board  it  is  at  least,  say,  1  or  2  pounds. 

L104:  If  that  is  OK,  I  believe,  you  check  circulator  maybe, 

L105:  to  see  that  you  have  enough  water  going  back  into  the  condenser 
L106:  to  keep  the  air  ejector  cool. 

E107 :  Why  the  air  ejector? 

L108:  Oh!  If  you  talk  about  the  vacuum  pump...  I  am  not  quite  familiar 
with  the  vacuum  pump.  Let  us  say  the  .... 

L109:  I  know  with  the  air  ejector  you  lose  vacuum  that  way. 

L110:  What  else  can  be  wrong? 

Elll:  So,  those  are  most  common  problems? 

L112:  Yes,  I  would  think  so,  unless  you... 

L113:  Assuming  the  condensate  pump  running  normally, 

L114:  that  will  be  the  first  indication,  the  major  cause. 

E115:  So,  what  else? 

L116:  Check  level  of  condenser. 

E117:  What  do  you  want  to  find? 

L118:  You  want  to  find  out  if  it  is  normal, 

L119:  if  it  is  too  high 

then...  you  see  it  could  be  getting  water  from  somewhere  else. 
E120:  What  do  you  mean  by  somewhere  else? 

L121:  Say,  for  example,  condensate  pump  may  not  be  functioning  properly 
E122:  Think  aloud. 

L123 :  It  would  oe  ...,  well. 

If  tne  level  is  OK,  then  you  look  for  somecnmg  eise. 

L124:  We  just  slow  down. 

L125:  You  assume  everything  else  is  OK. 

L126:  In  other  words,  circulating  water  is  fine, 

L127:  no  reason  for  that,  which  you  think  probably  will  be  ... 

L128:  I  don't  think  it  has  any  problem  with  the  bleeds 
L129:  I  don't  know.  The  symptom  is... 

E130:  Vacuum  dropping  when  slowing  down  to  50  from  full  speed. 

L131:  Possibly  the  bleeds,  you  could  lose  it  that  way. 

E132:  Any  other  possibilities? 

L133:  Like  I  said,  the  dump  could  cause  it, 

L134:  when  you  have  problems  with  the' 35  lb  line. 

L135:  If  that  was  the  problem  you  hear  the  35  lb  make  up 
and  you  hear  the  feedpump  speed  up  a  little. 

L136:  You  definitely  hear  the  35  lb  line  making  up 

L137 :  from  the  high  pressure  steam  coming  into  the  low  pressure  steam 
L138:  where  the  low  pressure  dumping  it  into  the  condenser. 

(S31  checked  the  35  lb  dump  regulator  and  found  the  solution) 


main  construct  for  representing  disease  categories. 

An  application  of  the  frame  concept  to  fault  diagnosis  involves 
assuming  the  existence  of  a  "hypothesis  frame"  which  contains  all 
relevant  information  about  a  particular  failure  situation.  In  other 
words,  for  each  possible  failure  or  hypothesis  there  is  a  frame  associ¬ 
ated  with  it.  Each  hypothesis  frame  may  contain  four  slots:  symptom 
slot,  component  slot,  inference  slot,  and  flow  slot.  Table  6-4  shows  an 
example  of  hypothesis  frame  for  condensate  pump  failure.  Six  different 
symptoms  are  associated  with  this  failure.  These  point  to  proper  rules 
in  the  symptom  knowledge  base.  Eight  relevant  components  are  listed. 
These  point  to  modules  in  the  system  knowledge  base  where  the  relation¬ 
ships  among  the  components  are  placed  in  proper  abstraction  level.  Com¬ 
ponents  may  point  to  modules  of  higher  abstraction  levels.  For  example, 
the  atmospheric  feed  tank  may  point  to  the  cooling  system  module  (see 
Figure  6-1)  instead  of  atmospheric  tank  module  or  feedwater  system 
module.  This  scheme  facilitates  the  diagnostic  process  that  involves 
reasoning  from  higher  abstraction  levels  and  proceeding  in  a  hierarchi¬ 
cal  manner.  The  inference  slot  contains  relevant  inferences  which  may 
be  derived  from  the  system  knowledge  base.  The  flow  slot  indicates 
which  type  of  flow  is  involved. 

The  relationships  among  system  knowledge,  symptom  knowledge, 
hypothesis  frame  database,  and  known  symptoms  are  depicted  in  Figure  6- 
2.  Known  symptom  set  is  used  to  match  rules  in  the  symptom  knowledge 
base.  Rules  matched  cause  the  appropriate  frames  from  the  hypothesis 
frame  database  to  be  chosen.  New  symptoms  may  be  found  while  processing 


Table  6-4:  Hypothesis  frame  for  a  condensate  pump  failure 


Symptom  slot:  (All  are  indexed  back  to  proper  rules 
in  the  symptom  knowledge  base.) 

condensate  pressure  low 
deaerator  pressure  low 
deaerator  level  low 
vacuum  pressure  low 
condenser  level  high 
LP  turbine  exhaust  temp  high 

Component  slot:  (All  are  indexed  into  proper  modules 
in  the  system  knowledge  base.) 

condensate  recirculate  control  valve 

feedwater  make-up  control  valve 

feedwater  make-up  regulator  pressure  transducer 

atmospheric  drain  tank 

deaerator  feed  tank 

main  condenser 

condensate  pump  suction  valve 
condensate  pump  discharge  valve 

Inference  slot: 

1.  Condensate  pump  failure  will  cause  low  condensate 
discharge  pressure; 

2.  therefore  more  condensate  remains  in  the  condenser 
than  normal. 

3.  This  in  turn  causes  condenser  level  to  go  high. 

4.  Once  the  level  goes  high,  it  loses  vacuum  and  the 
LP  turbine  exhaust  temperature  increases. 

5.  low  condensate  discharge  also  causes  the  deaerator 
level  and  pressure  to  go  low. 

6.  This  in  turn  triggers  the  feedwater  makeup  control. 

7.  which  results  in  the  atmospheric  drain  tank  dumping 
water  into  the  deaerator  tank. 

Flow  slot: 


feedwater 


i-: 

Figure  6-2:  Relationships  among  known  symptom  set  symptom  i. 

knowledge  base,  hypothesis  frame,  and  system 
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through  inquiry  with  of  system  knowledge  base. 


frames.  In  this  case  the  known  symptom  set  should  be  updated.  The  sys¬ 
tem  knowledge  base  provides  the  information  about  system  structure  to 
support  the  processing  of  frames.  On  the  other  hand,  information  in  a 
frame  may  be  updated  (e.g.,  corrected  or  augmented)  by  interacting  with 
the  system  knowledge  base.  Also  new  frames  may  be  acquired  through 
years  of  diagnostic  experience.  In  other  words,  learning  may  be 
involved.  However,  the  mechanism  of  acquisition  and  modification  of 
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hypothesis  frames  is  a  complicated  learning  process.  The  learning  pro¬ 
cess  is  not  discussed  in  this  thesis.  Nevertheless  the  existence  of 
hypothesis  frames  is  assumed  and  a  model  for  the  diagnostic  processi s 
described  based  on  this  assumption. 

A  Model  for  the  Diagnostic  Process 

At  the  beginning  of  a  diagnostic  process,  there  are  some  given 
symptoms  in  the  known  symptom  set.  As  the  diagnosis  proceeds,  this 
symptom  set  is  updated.  The  updating  process  may  not  simply  be  the  addi¬ 
tion  of  new  symptoms.  Deductions  may  be  drawn  from  symptoms.  For  exam¬ 
ple,  while  solving  Failure  23  in  which  both  feedpumps  were  heard  to  make 
noises,  most  subjects  would  conclude  that  the  trouble  lay  upstream  of 
the  feedpumps.  The  trouble  could  not  be  the  feedpumps  themselves 
because  it  is  very  unlikely  that  both  pumps  would  go  down  at  the  same 
time.  This  new  deduction  about  the  system  from  the  symptoms  becomes  a 
part  of  the  known  symptom  sec. 

Symptoms  in  chis  set  are  ased  to  natch  with  the  symptom-oause 
rules.  Rules  in  the  symptom  knowledge  base  may  be  organized  into  parti¬ 
tions  of  several  different  categories,  such  as,  steam-related,  noise- 
related,  fuel-related,  etc.  There  are  two  ways  to  make  the  matching 
process  efficient.  The  first  is  the  facts  about  the  particular  failure 
under  investigation.  These  facts  are  generalized  from  the  known  symptom 
set  and  the  experience  of  evaluating  frames.  Examples  of  commonly  used 
facts  are  "trouble  should  be  upstream  of  the  feedpumps  since  both  pumps 
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have  failed",  or  "it  is  a  steam  demand  problem",  etc.  These  facts  form 
a  search  direction  that  guides  the  rule-matching  process  to  search  in 
certain  partitions  only  instead  of  the  whole  database. 

Another  way  to  speed  up  the  matching  process  is  to  use  heuris¬ 
tics.  An  example  of  a  heuristic  is:  "choose  frames  of  higher  priority". 
Rules  in  the  symptom  knowledge  base  are  prioritized.  Subjects  have  a 
tendency  to  choose  the  higher  priority  rules.  In  Table  6-3,  given 
vacuum  loss  when  slowing  down  the  ship  (Failure  19),  S31  checked  the 
gland  seal  steam  system  first,  followed  by  the  35  lb  line  system.  In 
fact,  13  out  of  19  subjects  observed  checked  gland  seal  before  they 
checked  the  35  lb  line  system.  Another  example  of  a  heuristic  is: 
"choose  tighter  rules  whenever  possible".  It  was  observed  that  if  the 
vacuum  loss  symptom  occurred  at  full  speed  (Failure  3),  a  different 
sequence  of  actions  would  be  adopted.  19  out  of  20  subjects  checked  the 
condensate  system  first  instead  of  the  gland  seal  system  which  was  the 
most  preferred  action  when  vacuum  loss  was  paired  with  speed  reduction. 

Once  an  appropriate  ruie  is  found  relevant ,  tne  sorresponding 
nypothesis  frame  is  proposed.  During  the  processing  if  a  frame,  i.e., 
evaluating  the  hypothesis,  subjects  interrogate  the  system  to  match  the 
information  in  the  slots.  Any  new  symptoms  found  are  fed  back  to  the 
known  symptom  set.  If  new  symptoms  are  found  in  the  process  of  evaluat¬ 
ing  the  hypothesis  frame,  these  new  symptoms  and  old  symptoms  may  be 
used  to  search  in  the  symptom  knowledge  base  for  tighter  or  higher 
prioritized  hypothesis  frames.  However,  if  inferences  drawn  from  frames 
are  in  conflict  with  observations  from  the  known  symptoms,  then  these 


frames  should  be  discarded. 

If  no  new  symptoms  or  rules  are  found,  subjects  may  resort  to 
another  type  of  heuristic.  Some  observed  heuristics  are  given  below. 

1.  Physical  closeness:  If  subjects  failed  to  find  any  significant 
symptom  from  the  system,  they  tended  to  search  through  subsys¬ 
tems  or  components  that  were  physically  close  to  the 
subsystem/component  that  they  currently  hypothesized  to  have 
failed.  For  example,  while  solving  Failure  7  in  which  the  HP 
turbine  vibration  excessive  alarm  sounded  and  the  engine  noise 
level  was  higher  than  normal,  S21  started  from  the  superheated 
steam  frame.  Then  he  checked  every  component  in  sequence, 
starting  from  the  burner  and  going  all  the  way  up  to  the  FD 
fan.  All  these  components  were  physically  close,  but  not 
necessarily  relevant  to  this  particular  failure.  Among  6  sub¬ 
jects  who  had  difficulty  with  this  failure,  4  had  a  tendency  to 
process  subsystems  that  were  physically  close. 

2.  Pick  a  best-matched  frame:  Investigate  frames  that  are  matched 
best.  In  Table  6-3,  after  failing  to  confirm  the  gland  seal 
and  circulating  water  frames,  the  subject  tried  to  process  the 
bleed  frame.  However,  at  first  he  was  not  convinced  that  bleed 
was  the  problem  (L128).  The  reason  was  that  there  should  have 
been  other  symptoms  preceding  the  loss  of  vacuum  (L135)  if 
bleed  was  the  problem.  Nevertheless  this  frame  was  chosen 
under  the  best-matched  heuristic. 
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3.  Direct  mapping  from  symptoms:  Investigate  frames  that  contain 
the  components  or  subsystems  indicated  in  the  symptoms.  For 
example,  8  out  of  21  subjects  picked  feedpump  failure  frame 
when  presented  with  Failure  8  in  which  feedpump  noises  are 
heard . 

Table  6-5  lists  types  of  heuristics  used  by  subjects  who  had  dif¬ 
ficulty  solving  failures  rated  as  moderate  and  difficult  in  Chapter  V. 
Figure  6-3  presents  this  model  in  a  flowchart-like  schematic. 


Table  6-5:  Types  of  heuristics  used  by  subjects.  PC  denotes 
Physical  Closeness.  BM  denotes  Best-Matched  heuristics. 
DM  denotes  Direct  Mapping  from  symptoms.  PC  and  DM 
were  identified  from  the  subjects'  recorded  actions, 
while  BM  was  identified  from  protocols  which  were  only 
conducted  on  some  subjects.  Hence  BM  occurred  less  often 
then  PC  and  DM.  Note  that  a  subject  might  use  more  than 
one  type  of  heuristic  in  a  failure.  Also  note  that  not 
every  heuristic  is  identified  in  this  analysis. 


Failure 

16 

20 

moderate  21 
26 

7 

8 

difficult  9 
10 
19 


PC  BM  DM 

1  -  7 

7  -  3 

5  -  1 

9  -  - 

4  -  - 

3  1  8 

13-2 
11-2 


no.  of  subjects 

8 

10 

6 

9 

6 

21 

14 

15 
8 


1 
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The  model  proposed  in  previous  sections  is  abstracted  from  exper¬ 
imental  data  and  protocols.  There  are  six  pieces  of  protocols  taken 
from  four  subjects  (S31,  S32,  S33,  S34).  Four  out  of  six  protocols  were 
taken  while  subjects  were  solving  Failures  8,  9,  19  and  23  respectively. 
The  other  two  recorded  subjects'  description  of  how  the  powerplant 
works,  and  the  function  of  gland  seal  steam.  Four  diagnostic  protocols 
are  interpreted  below  in  terms  of  the  model  outlined  in  the  last  sec¬ 
tion. 


Protocol  1_: 

Table  6-3  lists  an  excerpt  from  a  protocol  taken  while  S31  was 
diagnosing  Failure  19  in  which  vacuum  was  dropping  at  reduced  speed. 
S31  might  take  the  vacuum  dropping  as  a  symptom.  However,  "at  reduced 
speed"  seemed  to  provide  another  piece  of  information.  Therefore  a 
tighter  rule  that  considered  both  was  preferred.  He  matched  antecedents 
of  the  rules  in  the  symptom  knowledge  base  with  these  two  known  symp¬ 
toms.  If  he  had  the  rules  3S  in  Table  6-1  in  his  repertoire,  then  rule 
1  instead  of  rule  2  would  be  matched.  Therefore,  he  first  processed  the 
gland  seal  regulator  failure  frame  since  this  frame  had  the  highest 
priority  (L101,  L102).  Once  the  frame  was  brought  into  attention,  he 
tried  to  match  and  evaluate  the  frame  by  interrogating  the  system  3bout 
the  information  in  the  slots.  This  was  a  straig1- Li ->rward  failure  in 
which  the  gland  seal  steam  pressure  had  to  be  low.  So  S31  checked  this 
information  with  the  system.  However,  the  system  responded  that  the 
gland  seal  pressure  was  normal.  The  subject  rejected  this  frame  without 
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finding  any  new  symptoms. 

He  kept  searching  down  the  priori  v  list  and  processed  the  second 
most  probable  frame,  the  circulating  water  failure  frame  (L104,  L105;. 
He  checked  the  system  to  see  if  the  circulating  water  pressure  was  low, 
which  might  be  the  first  symptom  in  the  symptom  slot.  The  system 
responded  that  it  was  alright.  Therefore  he  processed  the  third  frame 

in  the  priority  list  -  the  condensate  pump  failure  frame  (L112,  L1131. 

This  time,  instead  of  checking  symptoms  in  the  symptom  slot,  he  followed 
the  inference  path  in  the  inference  slot  (see  Table  6-4)  to  reaci  an 
assertion  that  if  the  frame  was  appropriate  then  the  condenser  level 
must  have  been  high  (L118,  L119).  So  he  checked  it  and  found  that  it 
was  alright  again.  At  this  moment  S31  was  stuck  and  found  no  further 
symptoms  to  help  continue  the  diagnostic  process. 

After  a  little  pause,  he  seemed  to  pick  up  a  frame  that  he 
thought  was  related  but  did  not  match  the  known  symptoms  well  (L128). 
Here  we  could  assume  that  he  was  applying  the  "pick  a  best-matched 
frame"  heuristic.  By  using  the  information  provided  in  the  inference 
slot  of  the  new  frame  (the  35  lb  line  system  failure  frame),  he  was  able 
to  make  several  qualitative  inferences  about  the  system  changes  quickly 
(L135,  L136,  L137).  He  checked  the  35  lb  steam  dump  regulator  and  found 
the  solution. 

Protocol  2. : 

Another  protocol  was  taken  while  S33  was  solving  Failure  9.  It 
showed  how  heuristics  were  employed  and  how  frames  with  conflicting 


> 


1 


1 
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symptoms  were  rejected  as  well  as  how  modules  in  the  system  knowledge 
base  were  employed  to  explain  failures.  Table  6-6  presents  an  abridged 
version  of  the  protocol. 

Failure  9  described  a  situation  in  which  white  haze  was  seen. 
S33  quickly  responded  that  too  much  air  was  the  problem  (L552) .  After 
locating  the  failure  in  a  rather  high  abstraction  level  (in  this  case 
-  the  air  supply  system),  S33  got  into  a  lower  abstraction  level  rea¬ 
soning  starting  with  the  FD  fan  (L553).  At  this  moment  the  FD  fan 
failure  frame  was  brought  into  attention.  The  inference  slot  of  this 
frame  might  contain  the  statements  such  as  L559  and  L560.  These  infer¬ 
ences  led  S33  to  conclude  that  air  pressure  should  be  changed.  From 
L554  to  L558,  S33  tried  to  explain  how  the  control  of  the  FD  fan  works 
in  terms  of  modules  of  lower  abstraction  level  such  as  diaphragms,  and 
control  signals.  In  fact,  a  rather  simplified  model  of  how  the  fan 
functions  along  with  the  boiler  was  implied.  Figure  6-4  depicts  this 
model  in  a  block  diagram. 

This  type  of  knowledge  does  not  appear  in  the  hypothesis  frame 
itself.  However,  the  component  slot  of  the  frame  might  point  back  to 
modules  in  the  system  knowledge  base  in  which  this  type  of  knowledge 
appears  as  definitions,  functions  or  connections  with  other  modules. 

Having  failed  to  find  the  fault  in  the  air  supply  system,  S33 
decided  to  try  the  fuel  oil  supply  system.  He  found  that  fuel  oil  tem¬ 
perature  was  normal  and  pressure  was  high  (L575,  L376).  He  decided  to 
investigate  the  fuel  oil  system.  However,  in  reality  most  failures  in 


Table  6-6:  Protocols  from  S33  while  solving  Failure  9 


Failure  9  (White  haze  is  seen  on  STBD  boiler  periscope)  (S33) 


L550:  White  haze  means  too  much  air. 

L531:  Let  us  check  the  STBD  burners'  management  system. 

It  says  it  is  running. 

L532:  Look  at  the  air  system. 

Too  much  air  may  be  the  problem. 

L333:  So  burners'  management  sytem  is  OK. 

Let  us  start  with  the  fan  and  work  our  way  down. 

L334:  I  got  pressure  to  the  boiler  out  of  the  fan. 

L333:  Signals  go  into  the  combustion  control  of  that  fan, 
L336:  that  regulates  the  fan. 

L337:  Maybe  the  combustion  control  is  sending  a  wrong  signal 
to  the  top  of  the  diaphragm. 

L338:  Or  the  diaphragm  is  broken. 

L339:  If  it  fails  open  then  I  have  more  air  than  I  need. 
L360:  So  check  the  air  pressure. 

L361:  It  is  high. 

L362:  The  fan  is  OK  since  it  is  running. 

Maybe  the  crossover . normal. 

L363:  Go  down  the  line  see  if  I  can  pick  up  something  else. 
L564:  Check  windbox  down  the  line. 

It  says  high .  but  crossover  is  normal. 

L363:  So  nothing  is  wrong  along  this  line. 

FD  fan  is  running. 

L366:  Maybe  it  has  to  be  slow.  Well,  OK. 

L567 :  I  will  say  maybe  my  fuel  oil  has  problem. 


L373:  Let  us  check  out  the  temp  of  the  fuel. 
L376:  Temp  is  normal,  pressure  is  slightly  high. 


L386:  Check  the  strainer...  no,  that  cannot  be  it. 
L387 :  Because  that  should  cause  pressure  to  go  low. 


L392:  Something  wrong  in  the  pump?  No,  it  cannot  be  because 
it  would  be  low  too. 

L393:  If  the  relief  valve  is  open.... 

L394:  It  would  be  low  too. 


L605:  I  am  stuck. 

(The  solution  should  be  blockage  in  the  STBD  boiler  tubes) 


Figure  6-4:  A  simplified  model  of  the  air  supply 
system  which  was  implied  by  S33  in 
his  protocol. 

the  fuel  oil  system  would  show  opposite  symptoms,  i.e.,  low  instead  of 
high  fuel  oil  pressure.  "Direct  mappirg  from  symptoms"  heuristic  might 
be  employed  to  explain  why  S33  took  this  move  which  turned  out  to  be 
futile. 

S33  then  picked  up  component  failure  frames  along  the  fuel  oil 
line  and  found  that  they  all  conflicted  with  the  fact  that  fuel  oil 
pressure  was  high  (L586,  L387,  L392,  L393,  L594).  So  these  frames  were 
rejected  one  by  one.  Finally,  S33  claimed  that  he  was  stuck  and  gave  up 
(L605).  The  key  to  the  solution  lies  in  the  rules  associated  with  white 

haze.  S33  seemed  to  have  only  one  cause  to  explain  for  white  haze  - 

too  much  air.  However,  in  this  case,  a  boiler  tube  leak  was  the  cause 
of  the  white  haze.  The  high  air  pressure  and  fuel  oil  pressure  were  the 
consequences  for  compensating  for  steam  loss. 


s 


Protocol  3 : 


Table  6-7  shows  the  first  part  of  the  protocols  taken  while  S3 2 
was  solving  Failure  23.  In  failure  23  boiler  pressure  dropped  and  smoke 
was  seen  on  both  boilers  while  the  ship  was  brought  back  to  full  speed 
ahead.  At  the  same  time  low  atomizing  steam  alarm  sounded.  S32  first 
considered  only  part  of  the  known  symptoms,  i.e.,  both  pressures  drop¬ 
ping.  He  retrived  the  fuel  oil  system  failure  frame  and  found  conflicts 
of  symptoms  (L201).  He  then  rejected  this  frame  and  processed  another 
frame  -  smoke  frame.  This  time  he  successfully  explained  both  pres¬ 
sure  and  smoking  symptoms  (L203  -  L207).  At  this  moment,  low  atomizing 
steam  symptom  was  brought  into  attention.  S32  was  able  to  make  several 
quick  inferences  about  the  failure  of  the  atomizing  steam  regulator. 
The  inferences  seemed  to  match  with  the  known  symptoms. 


S32  then  proceeded  to  find  the  solution  -  atomizing  steam 

reducing  regulator  failure.  This  analysis  showed  a  pitfall  troub¬ 
leshooters  committed  often:  neglecting  known  symptoms.  If  S32  did  not 
overlook  the  low  atomizing  steam  from  the  beginning  he  need  not  have 
gone  through  the  first  two  frames.  This  seemed  to  point  out  a  possibil¬ 
ity  for  aiding.  An  appropriately  designed  fault  diagnosis  aiding  system 
should  be  able  to  help  the  troubleshooters  choose  the  tightest  rules  by 
not  overlooking  symptoms.  Rouse  (1981)  noted  similar  phenomenon.  He 
found  that  valuable  negative  information  was  overlooked  in  a  context- 
free  logical  troubleshooting  task. 
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Table  6-7:  Protocols  taken  from  S32  while  solving  Failure  23. 


Failure  23  (When  rising  back  to  full  speed,  boiler  pressure 

dropped  and  smoking  in  both  boilers.  Also  the  low 
atomizing  steam  alarm  sounded)  (S32) 


L200:  First  we  need  to  find  out  what's  causing  the  boiler  pressure  to  drop. 
L201:  Check  fuel  oil  pressure. 

You  said  it  is  OK. 

L202:  And  you  said  smoking. 

L203 :  That 1 11  probably  indicate  the  pressure  would  be  extremely  high 
L204:  because  it  is  a  large  load  from  the  boiler. 

L205:  So  that  could  cause  the  smoke. 

L206:  It  would  be  too  much  demand  for  the  boiler  to  make  up. 

L207 :  That's  the  reason  the  boiler  pressure  is  dropping. 

L208:  With  low  atomizing  steam  we  have  to  check  the  150  lb  system 
to  see  if  that  is  normal. 

L209:  If  that's  normal,  we  have  to  go  to  the  atomizing  steam  regulator 
to  see  that  is  functioning  properly. 

L210:  If  that  fails,  you  have  poor  atomization  which  undoubtably 
will  trip  the  boiler. 

E211:  Cause  the  smoke? 

L212:  Surely,  because  you  have  low  atomizing  steam, 

L213:  and  you  would  have  to  slow  down  to  bring  the  pressure  back 
so  you  don't  lose  the  plant. 

L214:  Doing  that  will  bring  up  the  fuel  oil  pressure  back. 

L215:  Remedy  the  atomizing  steam  somewhat  or  bypass  it  and  find  out  the  fault. 
That  seems  to  be  the  problem. 

(S32  found  the  solution:  atomizing  steam  reducing  regulator  failure) 
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Table  6-8:  Protocols  taken  from  S34  while  solving  Failure  8 

Failure  8  (Vibration,  noise  and  speed  fluctuation  occurred 
in  the  no.  1  pump.  When  the  engineer  started  the 
no.  2  pump,  the  same  symptoms  occurred  again.)  (S34) 

L450:  Well,  maybe  loss  of  suction  because  that  is  common  to  both  pumps. 

L451:  Check  the  level  of  the  deaerator  (normal). 

L452:  I  have  checked  out  the  suction  side  of  the  water  side  of  both  pumps. 
L456:  What  will  cause  fluctuation.  Maybe  there  is  .... 

L457:  I  will  go  to  my  steam  system. 

L458:  Now  I  have  eliminated  the  suction  water  side  of  the  pump. 

I  am  up  to  the  economizer. 

L459:  Check  temp  gauges  coming  in  and  out  of  economizer. 

L460:  I  wonder  if  there  is  a  leakage  in  the  economizer? 

L461:  Maybe  the  economizer  is  leaking. 

L462:  Sometimes  when  you  started  up  a  pump  it  will  cause  speed 
fluctuation  and  trip  out. 

L463:  Inlet  is  normal check  the  outlet. 

L464:  If  outlet  is  low,  then  that  means  the  economizer  is  plugged  up. 

L465:  Normal.  So  every  thing  on  the  water  side  is  OK. 

L466:  Now  I  am  going  to  the  steam  aide  of  the  feedpump. 

L468:  Check  if  35  lb  line  has  back  pressure.... 

L469:  Check  if  35  lb  pressure  is  normal  (normal). 

L470:  If  that  is  normal,  back  pressure  cannot  be  low. 

L471:  I  will  be  getting....  Pumps  would  not  be  fluctuating. 

L472:  Now  go  on  to  the  steam  side  of  the  inlet. 

L473:  I  cannot  see  nothing  that  is  common... 

L475:  Desuperheated  steam  for  my  feedpump.... 

L476:  Check  if  both  of  them  come  out  right. 

L477 :  I  want  to  find  a  cause  common  to  both  feedpumps. 

L478:  I  have  checked  my  pressure  and  levels. 

L479:  I  wonder  about  if  the  temp  of  the  steam  is  relevant... 

L480:  If  cold  water  goes  into  the  feedpump, 

L481 :  It  would  cause  it  to  flash  the  steam 
L482:  And  the  pumps  would  lose  suction. 

L483 :  A  lot  of  times  when  we  have  problems  with  our  deaerator 
L484:  And  we  have  to  bypass  the  deaerator. 

L485:  We  take  water  from  our  distilled  tank. 

L486:  The  water  in  the  deaerator  is  275  degrees, 

L487 :  And  the  water  coming  from  the  distilled  tank  is  90  degrees. 

L488:  Difference  in  change  will  cause  it  to  flash. 

L489:  And  the  pump  will  lose  suction. 

L490:  I  am  going  to  check  out  the  temp  of  the  water  coming  from 
the  deaerator. 

L499:  .  Steam  temp  may  be  a  problem. 

L500:  20  degrees  low  coming  out  of  the  desuperheater. 

L501:  Temp  of  steam  could  cause  fluctuation  of  the  feedpump 
L502:  Because  the  higher  the  temp  the  more  the  work  you  get  out  of  the  steam 
L503:  So  if  it  is  20  degrees  low,  it  got  to  be  something  wrong 
in  my  desuperheater. 

L504:  I  am  now  backtracking....  coming  out  of  the  superheater 
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L505:  Going  into  the  desuperheater. 

L506:  Backtrack  a  little  bit  further. 

L507:  Come  out  of  the  drum  going  to  my  superheater. 

L508:  I  am  going  to  check  my  drum  level  (normal). 

L509:  So,  back  into  my  superheater. 

L510:  It  has  to  be  the  attemperator ,  the  desuperheater  or  the  superheater. 
(At  this  point  S33  found  the  solution  -  desuperheater  failure) 


Protocol  4.: 

Table  6-8  is  an  abridged  version  of  the  protocols  taken  while  S34 
was  solving  Failure  8.  In  Failure  8,  vibration,  noise  and  speed  fluc¬ 
tuation  occurred  in  the  no.  1  feedpump.  When  the  engineer  started  the 
no.  2  pump,  the  same  symptoms  occurred  immediately.  S34  seemed  to  be 
directed  by  the  heuristics  of  "direct  mapping  from  symptoms".  The  frame 
of  feedpump  failure  was  investigated  first  (L452).  The  known  symptoms 
seemed  to  be  augmented  by  an  observation  deduced  from  given  symptoms, 
i.e.,  the  fault  must  be  common  to  both  pumps  (L450).  Having  failed  to 
find  any  significant  evidence  to  support  this  frame,  S34  proceeded  to 
the  economizer  failure  frame  (L458  -  L466).  This  transition  of  frame 
involves  no  new  symptoms.  S34  did  not  seem  to  have  any  rule  that  could 
cover  the  symptoms  (L456).  Transition  from  feedpump  failure  frame  to 
economizer  failure  frame  may  be  explained  by  heuristics  such  as  "physi¬ 
cal  closeness"  or  "best-matched  frame”.  The  economizer  frame  seemed  to 
support  the  fluctuation  symptoms.  Several  inferences  were  drawn  from 
this  frame  to  come  up  with  a  scheme  for  evaluation,  i.e.,  test  to  see  ii 
<  ’f.erences  existed  between  inlet  and  outlet  pressures.  Having  failed 
to  find  evidence  to  support  the  economizer  frame  (L465),  S34  considered 
the  35  lb  line  system  failure  frame  (L466  -  L471).  However,  the  infer¬ 
ence  S34  drawn  from  this  frame  was  in  conflict  with  the  known  symptoms 


fluctuation  (L471).  So,  this  frame  was  rejected 


Then  S34  went 


back  to  the  feedpump  failure  frame  again  (L475  -  L482) .  This  time  he 
investigated  the  possibility  of  cold  water  going  into  the  feedpump.  He 
found  that  it  could  cause  loss  of  suction.  This  inference  resulted  in 
consideration  of  the  deaerator  failure  frame  for  processing  only  to  find 
that  it  was  normal  (L483  -  L490). 

Although  the  last  hypothesis  frame  failed,  the  possibility  of 
difference  of  temperature  seemed  to  lead  S34  to  have  the  idea  of  steam 
temperature  difference  (L499).  A  new  symptom  was  then  found,  i.e.,  tem¬ 
perature  was  20  degrees  lower  than  normal  coming  out  of  the  desu¬ 
perheater  (L500).  The  desuperheater  failure  frame  was  then  brought  in 
for  processing  (L501  -  L510).  Finally,  S34  found  the  problem  in  the 
desuperheater. 

These  four  protocols  show  how  diagnostic  process  proceeds  from 
symptoms  to  the  faulty  components.  The  model  seemed  to  interpret  the 
protocols  reasonably  well.  The  following  discusser  how  the  attributes 
of  the  model  fit  the  protocols,  and  compares  the  model  with  other 
approaches. 

Discussions 

This  model  is  characterized  by  three  key  aspects:  heuristics, 
knowledge  bases,  and  diagnostic  processes.  Heuristics  are  employed  when 
no  obvious  frames  can  be  found.  Five  heuristics  were  identified  for 
this  model.  They  are:  physical  closeness,  best-matched,  direct-mapping, 
tighter  rule,  and  higher  priority  heuristics. 


Knowledge  bases  contain  all  relevant  domain  knowledge.  They  are: 
the  system  knowledge,  symptom  knowledge,  and  hypothesis  frame  bases. 
System  knowledge  base  contains  modules  of  different  abstraction  levels, 
and  is  organized  hierarchically.  The  definition,  function,  and  connec¬ 
tions  of  components  may  be  found  in  the  modules.  The  dynamics  of  a  com¬ 
ponent  is  described  in  qualitative  terms  and  represented  in  a  module. 
This  has  the  potential  to  allow  the  model  to  "reason  from  the  first 
principles"  which  has  been  advocated  by  Davis  (1983).  The  symptom 
knowledge  base  is  in  some  sense  an  index  file  for  the  hypothesis  frames. 
The  keys  are  the  matching  symptoms.  The  symptom-cause  pairs  associate 
matching  symptoms  with  appropriate  hypothesis  frames. 

The  hypothesis  frame  base  contains  frames  for  individual  failures 
(see  Table  6-4).  This  is  similar  to  the  "compiled"  deep  knowledge  con¬ 
cept  of  Chandra sekar an  and  Mittal  (1983).  In  their  proposal,  the 
knowledge  needed  for  diagnostic  reasoning  is  "compiled"  into  the  system 
knowledge  structure.  This  allows  reasoning  to  proceed  efficiently 
without  the  need  to  access  deep  knowledge  such  as  those  of  the  "first 
principles".  However,  "compiled"  deep  knowledge  scheme  limits  the 
access  of  knowledge  to  certain  paths  that  are  reachable  under  the 
knowledge  organization.  The  information  in  hypothesis  frames  can  be 
used  independent  of  reasoning  paths  since  they  are  independent  of  the 
knowledge  structure.  Therefore  they  can  be  elicited  from  any  point  of 
the  reasoning  process.  This  database  is  highly  modularized.  Modifica¬ 
tion  and  addition  of  frames  can  be  accomplished  with  minimal  effort. 

The  inferences  provided  in  the  inference  slot  in  a  hypothesis 
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frame  make  Che  forward  reasoning  possible.  Using  information  contained 
in  the  inference  slot,  the  model  can  infer  about  the  system  state  as  a 
result  of  the  fault  such  as  "boiler  pressure  drop"  etc.  This  enables 
the  model  to  generate  schemes  for  testing  the  frames  of  interest  against 
the  system  state. 

There  are  four  types  of  processes  involved  in  the  model.  They 
are:  symptom  updating,  frame  elicitation,  frame  evaluation,  and 
hierarchical  reasoning.  These  processes  are  widely  recognized  and 
implemented.  Geneseretb  (1982)  advocated  a  computer  hardware  fault 
diagnosis  system  using  hierarchical  design  models.  The  program,  called 
DART,  first  diagnoses  the  system  at  a  high  level  of  abstraction  to 
determine  the  major  subsystem  in  which  the  fault  may  lie.  It  then 
focuses  on  the  next  lower  level  and  repeats  until  the  fault  is  found. 

Frame  elicitation  and  frame  evaluation  are  conceptually  similar 
to  hypothesis  formation  and  hypothesis  evaluation.  Hypothesis  formation 
and  evaluation  has  been  widely  used  in  medical  diagnosis  system  (Short- 
liffe  1976,  Pople  1977). 

The  model  proposed  in  this  thesis  integrates  these  attributes  of 
diagnosis  into  a  coherent  structure.  Protocol  analysis  in  the  previous 
section  demonstrated  how  this  model  can  be  used  to  explain  experts' 
fault  diagnosis  behavior.  Table  6-9  shows  the  number  of  times  each 
attribute  of  the  model  occurred  in  the  protocols.  Judging  from  the  fre¬ 
quency  of  occurrence  and  coverage  of  of  the  attributes,  the  model  seemed 
to  fit  the  protocols  reasonably  well. 
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This  model  incorporates  the  knowledge  structure  with  processing 
schemes.  Hence  it  has  the  potential  to  be  expanded  to  implement  an 
intelligent  on-line  coach  for  troubleshooting. 

Another  possible  use  of  the  model  is  to  develop  aiding  systems 
for  fault  diagnosis.  Slovic  et  al.  (1977)  pointed  out  that  the  effec¬ 
tiveness  of  aiding  depends  upon  the  ability  of  the  user  to  decompose  the 
overall  decision  problem  into  its  constituent  elements.  This  model 
shows  the  mechanisms  underlying  the  diagnostic  actions.  The  diagnostic 
behavior,  if  viewed  as  a  decision  task,  can  be  decomposed  into  sub¬ 
processes.  Hence,  better  aiding  schemes  could  be  developed  by  consider¬ 
ing  subprocesses  and  relationships  between  subprocesses. 

A  robust  theory  of  fault  diagnosis  should  consider  the  influence 
of  detection  and  compensation  in  addition  to  diagnostic  performance. 
The  model  proposed  above  is  not  a  robust  theory  of  fault  diagnosis  in 
this  sense.  However,  it  serves  to  organize  the  observations  from  sub¬ 
jects'  behavior  in  real  diagnosis  tasks.  Further  research  is  needed  to 
make  this  model  applicable. 


Table  6-9:  Occurrence  of  the  attributes  in  the  protocols. 

Each  entry  indicates  the  number  of  times  a 
particular  attribute  of  the  model  is  observed 
in  a  protocol. 


Protocols 

Attributes 

1 

2 

3 

4 

- - * 

Physical  closeness 

1 

Best-matched 

1 

1 

Heuristics 

Direct-mapping 

1 

Tighter  rule 

1 

Higher  priority 

1 

1 

1 

1 

Symptom 

1 

1 

1 

1 

System 

1 

1 

Knowledge 

Hypothesis  Frame 

Base 

Symptom 

2 

1 

Component 

1 

Inference 

2 

1 

3 

Flow 

1 

Symptom  updating 

1 

2 

Frame  elicitation 

4 

4 

2 

7 

Processes 

Frame  evaluation 

4 

4 

2 

7 

Hierarchical 

reasoning 


Conclusions 

A  model  for  the  fault  diagnosis  process  has  been  presented  based 
on  the  data  from  experiments  and  protocols.  A  conceptual  entity  called 
hypothesis  frame  was  employed  to  account  for  the  observations  from  data 
and  protocols.  Instead  of  reasoning  from  the  dynamics  and  structure  of 
the  system,  relevant  information  about  a  failure  is  "compiled"  in  the 
frames.  Therefore  subjects  could  make  quick  and  reliable  inferences 
which  would  otherwise  be  impossible.  Given  symptoms  of  a  failure,  sub¬ 
jects  would  first  try  to  match  a  frame  from  their  repertoire  of  rules. 
Heuristics  might  be  employed  to  make  this  matching  process  efficient. 
Once  a  frame  was  identified,  subjects  could  use  the  information  con¬ 
tained  in  the  slots  to  make  inferences  or  conduct  evaluations.  If  more 
symptoms  were  found,  subjects  might  use  the  new  evidence  to  search  for 
better  hypotheses.  If  inferences  drawn  from  frames  are  in  conflict  with 
known  symptoms,  these  frames  are  discarded.  When  no  obvious  frames 
could  be  processed,  subjects  could  search  in  the  system  or  symptom 
knowledge  base  for  relevant  information  under  the  guidance  of  heuris¬ 
tics. 


This  model  was  applied  to  Failure  19  (S31),  Failure  9  (S33) 
Failure  8  (S32)  and  Failure  23  (S34)  with  good  results.  This  model  may 
be  regarded  as  reasonable  for  organizing  the  experimental  observations 
and  protocols  from  fault  diagnosis  performance.  Further  refinements  and 
testing  are  needed  before  it  can  be  applied  in  training  and/or  the 
design  of  on-line  coach  system  for  fault  diagnosis. 
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CHAPTER  VII 


CONCLUSIONS 


In  this  thesis,  ve  have  described  an  investigation  of  fault  diag¬ 
nosis  behavior  of  experts  in  a  realistic  environment  where  marine  power- 
plant  simulators  were  used.  The  initial  feasible  set  of  possible  faulty 
components  and  the  strategy  subjects  used  to  alternate  between 
hypothesis  formation  and  evaluation  stages  were  found  to  affect  the 
efficiency  of  performance.  This  finding  seems  to  imply  that  both 
context-specif ic  (initial  feasible  set)  and  context-free  factors  (tran¬ 
sition  strategy)  are  important  for  fault  diagnosis  training. 


A  micro  model  of  fault  diagnosis  that  incorporates  system 
knowledge,  symptom  knowledge  and  hypothesis  frames  into  a  coherent 
structure  was  proposed  and  used  to  interpret  protocols.  The  model  fit 
the  protocols  reasonably  well.  This  model  may  be  expanded  to  implement 
an  intelligent  on-line  coach  for  troubleshooting. 


The  research  discussed  in  this  thesis  used  a  low  fidelity  simula¬ 
tor.  A  possible  extension  of  this  research  is  to  study  fault  diagnosis 
behavior  using  moderate  and  high  fidelity  simulators  as  described  in 
Chapter  III.  Both  the  moderate  and  high  fidelity  simulators  have  the 
system  dynamics  incorporated,  which  enables  the  presentation  of  a  pro¬ 
cess  view  of  the  system  instead  of  a  frozen  picture.  This  may  result  in 
the  trainees  taking  compensatory  actions  before  diagnosis  is  completed. 
Study  of  how  these  actions  interact  with  diagnosis  and  how  the  model 
should  be  modified  to  incorporate  the  compensatory  actions  are  potential 
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research  problems 


Comparison  of  performance  between  naive  and  expert  subjects  will 
be  an  interesting  extension  to  this  research.  This  comparison  can  help 
understand  the  experts'  strategy  better  which  is  important  for  building 
intelligent  aiding  systems.  Also  the  differences  in  strategies  used  may 
indicate  how  people  become  experts. 

As  pointed  out  in  Chapter  VI,  this  research  considered  only  the 
empirical  constraints  on  modeling  the  experts 1  fault  diagnosis  perfor¬ 
mance.  To  make  this  model  implementable  as  an  intelligent  diagnostic 
aiding  system,  much  work  needs  to  be  done  to  explore  the  computational 
constraints . 

Finally,  research  using  expert  subjects  on  realistic  environments 
should  be  continued.  A  major  difficulty  lies  in  the  gathering  and 
analysis  of  protocols.  Understanding  the  strategies  used  is  difficult. 
More  efficient  methods  should  be  developed  to  extract  useful  and  sys¬ 
tematic  knowledge  from  the  protocols  or  data  automatically.  Some  prel¬ 
iminary  research  has  been  done  by  Waterman  and  Newell  (1975),  Bauer 
(1975),  and  Ericsson  and  Simon  (1984).  However,  an  automatic  protocol 
analyzer  has  not  been  developed. 
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List  of  Failures  Used  in  the  Pilot  Experiment 

Failure  No.  Faulty  Component  Symptoms 

1  STBD  Economizer  STBD  boiler  high  water 

alarm  sounds.  STBD  boiler 
drum  pressure  decreases. 

2  Condensate  Pump  Main  vacuum  goes  down  as 

plant  is  brought  to  sea 
speed.  Main  vacuum  alarm 
sounds . 

3  FO  Htr  Stm  Regulator  Smoke  alarms  &  low  fuel 

oil  temp  alarms  sound. 

Black  smoke  is  evident. 

4  PORT  FO  Settling  Tank  FO  Header  Pressure  Low 

alarm  sounds.  STBD  &  PORT 
FO  pressure  low  alarms 
sound . 

5  PORT  FO  Settling  Tank  All  flame  failure  alarms 

sound,  low  fuel  oil  pressure 
alarm  sounds.  All  fires  are 
ext inguished. 

6  Gland  Seal  Regulator  Low  vacuum  alarm  sounds  on 

main  condenser.  Vacuum  stea¬ 
dily  dropping  on  slowing  to 
half  ahead  during  maneuvering 

7  Deaerating  Feed  Tank  High  oxygen  level  in  water 

in  both  boilers. 

8  Supply  Regulator  PORT  boiler  starts  panting. 

PORT  smoke  alarm  sounds. 

9  Throttle  Oil  Booster  Main  engine  RPM  marginally 

Pump  above  preset  limit. 

10  PORT  Attemporator  Ctrl  High  superheated  steam  temp 

Valve  from  PORT  boiler. 

11  Condensate  Pump  Sue.  Main  condenser  low  vacuum 

Valve  alarm  sounds.  Main  vacuum 

is  steadily  dropping. 
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FO  Heater  Steam  Traps  Smoke  alarms  go  off  on  both 


boilers 


Sea  Strainer  Condenser  vacuum  low  alarm 

sounds . 

STBD  Air  Register  1  Repeated  flarebacks  on 

lighting  off  STBD  boiler. 


Fuel  Oil  Heaters 


Control  Air  Dryer 


Low  fuel  oil  temp  alarm  and 
smoke  alarms  on  both  boilers 
sound  upon  increasing  speed. 

Losing  main  vacuum  on 
slowing  to  half  ahead  from 

full  ahead.  Low  vacuum  alarm  ! 

sounds . 


FO  Pump  Sue.  Strainers 


STBD  Boiler  Drum  Dry 
Pipe 


Unable  to  maintain  steam 
pressure  &  temp  at  increased 
speed. 

Low  rumbling  noises  from 
HP  turbine  at  high  RPM. 


Appendix  B 

List  of  Failures  Used  in  Sessions  1  to  5 


*******  Failure  1: 

Initial  Condition:  The  ship  is  under  way  at  full  speed  (90  RPH) . 

Symptom:  Condensate  pressure  low  alarm  sounds. 

Failed  Component:  No.  100 - Condensate  pump 

(broken  flexible  coupling) 


Gauges  Affected: 

36— low 

19 - slightly  dropping 

27 - low 

58— low 

55 - high 

60 - high 

Components  Affected: 

100 - amperage  low 

104  - in  opening  position 

105  - closed 

78  - condenser  level  high  and  overboard  temp  slightly  high 

79  - hotwell  full 


*******  Failure  2: 

Initial  Condition:  The  ship  is  under  way  at  slow  speed  (20  RPM) 

Symptom:  When  speeding  up  to  full  speed  (70  rpm) , 
the  No.  2  boiler  level  drops  low. 

Failed  Component:  No.  139 - No.  2  feedwater  regulator  valve 

(stuck  open) 


Gauges  Affected: 

58 - slightly  raise 

57— low 

56 - level  low 

50 - slightly  high 

69 - 15  psi 

71 - 15  psi 

73 - 7  psi 

75 - 6  psi 

77 - 5  psi 

79 - 10  psi 

81 - 3  psi 
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83 - 3  psi 

85 - 5  psi 

87 - 5  psi 

89 - 9  psi 

91 - 5  psi 

93 - 9  psig 

Components  Affected: 

104  - closed 

105  - open 

113 - slightly  open 

139 - open  10Z 

Indicators  Affected: 

188,  190,  192  -  all  extractions  closed 

256  — —  scoop  closed 
259,  231,  233,  235 


*******  Failure  3: 

Initial  Condition:  The  ship  is  undervay  at  full  speed  (90  RPM) . 

Symptom:  Main  condenser  vacuum  low  alarm  sounds. 

Failed  Component:  No.  84 - Sea  scoop  (dirty) 

Gauges  Affected: 

27 - low 

35 - low 

55 - high 

94— disc,  pressure  low 

96— disc.  temp,  higher  than  normal 

Components  Affected: 

78 - Condenser  shell  is  warm 


*******  Failure  4: 

Initial  Condition:  The  ship  is  undervay  at  reduced  speed  (50  RPM). 

Symptom:  Main  condenser  vacuum  low  alarm  sounds. 

Failed  Component: 

No.  87 - Sea  strainer 

Gauges  Affected: 

27— low 

35 - low 

55 - high 

/  A  1C  » 
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94 - pressure  low 

96 - temp  higher  than  normal 

Components  Affected: 

78 - Condenser  shell  is  warm  and  condenser  level  is  low. 

84 - scoop  is  closed. 

105 - open  50% 

Indicators  Affected: 

188,  190,  192  -  all  extractions  are  closed 

256  -  scoop  closed 

259  -  low  sea  suction  open 

231,  233,  235  -  No.  1  cooling  water  circulator  on 

also  the  discharge  and  suction  valves  open 


*******  Failure  5 : 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM) . 

Symptom:  Rumbling  noise  is  heard  from  the  boilers.  The  No  1  boiler 
starts  to  smoke  and  trips.  Rumbling  stops. 

Failed  Component: 

No.  1  -  No.  1  forced  draft  fan 

Gauges  Affected: 

1  - 0  psig 

2  - 0  psig 

3  - low 

4  - low 

5  - low 

62 - black  smoke 

Components  Affected: 

1— amperage  is  low  and  motor  is  running 

33  - closed 

34  - closed 

31 - closed 
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Indicators  Affected: 
138,  139 


*******  Failure  6: 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM). 

Symptom:  High  drum  water  level  in  no.  2  boiler  sounds.  The  no.  2 
boiler  water  level  continues  to  rise. 

Failed  Component: 

No.  139  -  No.  2  feedwater  regulator  (broken  diaphragm) 

Gauges  Affected: 

58 - slightly  low 

57 - high 

93 - 0  psig 

Component  Affected: 

56  -  Gauge  glass  reads  high 

104  -  open  position 

139  -  1002  open 


*******  Failure  7 : 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM). 

Symptom:  High  pressure  turbine  vibration  excessive  alarm  sounds 
and  the  engine  noise  level  is  higher. 

Failed  Component: 

No.  52  -  No.  2  internal  attemperator 

Gauges  affected: 

52 - 20  degree  low 

Components  Affected: 

76 -  noise  abnormal 


*******  Failure  8: 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM). 

Symptom:  Vibration  and  noise  and  speed  fluctuation  occur  in  the 
no.  1  feedpump  (on  line).  When  the  engineer  starts  the 
no.  2  pump,  the  same  symptoms  occurs  immediately. 

Failed  Component: 

No.  54  -  no.  1  internal  desuperheater 
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Gauge 8  Affected: 

51 - 20  degrees  low 

Components  Affected: 

125 - noisy 


*******  Failure  9: 

Initial  Condition:  The  ship  is  under  way  at  full  speed  (90  RPM) . 

Symptom:  There  is  a  white  haze  in  the  no.  1  boiler  periscope. 

Failed  Component: 

No.  207  -  no.  1  boiler  tubes 

Gauges  Affected: 

1  - high 

2  - high 

3  - high 

4  - rhigh 

5  - high 

6  - slightly  high 

7  - slightly  low 

19 - 8 lightly  low 

62 - white  smoke 

44 - slightly  low 

64 - high 

77 - 8  psig 

89 - 4  psig 


*******  Failure  10: 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM). 

Symptom:  No.  2  boiler  trips.  White  smoke  alarm,  rumbling  noise 
from  boiler  2.  FO  pressure  low  alarm  sounds. 

Failed  Component: 

No.  30  -  FO  Control  Valve 

Gauges  Affected: 

10—  — higher  than  normal 

11—  higher  than  normal 

12—  higher  than  normal 

13 - higher  than  normal 

14— higher  than  normal 
15 - very  low 

16— low 


52 - slightly  high 

63 - white  smoke 

65 - low 


*******  Failure  11: 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM) 
Symptom:  High  superheated  steam  temperature  in  no.  2  boiler. 
Failed  Component: 

No.  48  -  no.  2  attemper ator  control  valve 

Gauges  Affected: 

52 - high 


Components  Affected: 
48— -closed 


*******  Failure  12: 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM) 

Symptom:  Fuel  oil  pressure  low  alarm  sounds  for  both  boilers. 

Fuel  oil  pressure  is  erratic.  White  and  black  smoke 
appear  alternatively. 

Failed  Component: 

No.  19  -  Fuel  oil  settling  tank  (water) 

Gauges  Affected: 

6 - erratic 

15 - erratic 

39 - erratic 

62  - alternate  between  white  and  black  smoke 

63  - alternate  between  white  and  black  smoke 

64— flow  level  fluctuating 

65 - flow  level  fluctuating 

Components  Affected: 

21 - noisy  and  discharge  pressure  erratic 


*******  Failure  13 : 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM) 

Symptom:  Low  vacuum  alarm  sounds  and  vacuum  is  dropping  when 
slowing  down  the  ship. 


Failed  Component: 

NO.  168  -  Gland  seal  regulator 


Gauges  Affected: 

27 - low 

55 - high 

31 - low 

Components  Affected: 

168 - low  pressure  output 


*******  Failure  14: 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM) . 

Symptom:  FO  temp  low  alarms  sound  on  both  boilers.  Smoke  alarms 
sound.  Boilers  indicating  black  smoke. 

Failed  Component: 

No.  26  -  fuel  oil  heater  steam  regulator 

Gauges  Affected: 

41 - low 

47 - low 

62  - black  smoke 

63  - black  smoke 


*******  Failure  15: 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM). 

Symptom:  When  the  bridge  requests  a  speed  reduction  to  70  RPM, 
the  no.  1  boiler  periscope  becomes  hazy. 

Failed  Component: 

No.  3  -  no.  1  boiler  air  damper  (stuck  open) 

Gauges  Affected: 

1  - high 

2  - high 

3  - high 

4  - high 

5  - high 

46 - increasing 

62—hazy (white  smoke) 

64 - higher  than  normal 

Components  Affected: 

3 - open  excessively 


*******  Failure  16: 


Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM). 
Symptom:  Both  boilers  are  smoking. 

Failed  Component: 

No.  24  -  fuel  oil  heater  steam  trap 

Gauges  Affected: 

41 - low 

47 - low 

62  - dark  smoke 

63  - dark  smoke 

Components  Affected: 

26 - fully  open 


*******  Failure  17: 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM). 

Symptom:  Steady  loss  of  water  in  no.  2  boiler. 

Boiler  trips  on  loss  of  all  flame. 

Failed  Component: 

No.  208  -  no.  boiler  tubes 

Gauges  Affected: 

16  - low 

17  - low 

18  - low 

50 - lower  than  normal 

52 - slightly  low 

57  - low 

58  - low 

Components  Affected: 

56- - gauge  glass  level  dropping 

104 - open 

139 - fully  open 


*******  Failure  19: 

Initial  Condition:  The  ship  is  underway  at  reduced  speed  (50  RPM) 
Symptom:  A  drop  in  vacuum  occurs  after  speed  was  reduced. 
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Failed  Component: 

No.  157  - -  35  system  dump  regulator 

Gauges  Affected: 

27 - low 

23 - lower  than  normal 

55 - higher  than  normal 

Components  Affected: 

78 - overboard  temperature  slightly  high 

89 - fully  open 

1 57 — -open  40  % 

Indicators  Affected: 

188,  190,  192  -  all  extractions  closed 

256  -  scoop  closed 

231,  233,  235  -  1  circulator  is  running  with  valves  open 


*******  Failure  20: 

Initial  Condition:  The  ship  just  came  out  of  shipyard  and 

is  underway  at  full  speed  (90  RPM). 

Symptom:  Ocassionally  experiences  sudden  drop  and  rise  of 
superheater  outlet  pressure  and  a  sudden  drop 
and  rise  of  turbine  speed. 

Failed  Component: 

No.  49  -  1  superheater  (blockage) 

Gauges  Affected: 

8 - suddenly  rise  and  drop 

1  - suddenly  rise  and  drop 

2  - suddenly  rise  and  drop 

3  - suddenly  rise  and  drop 

4  - suddenly  rise  and  drop 

6 - suddenly  rise  and  drop 

7— suddenly  rise  and  drop 
9— suddenly  rise  and  drop 
46 - suddenly  rise  and  drop 


*******  Failure  21: 

Initial  Condition:  The  ship  is  underway  st  reduced  speed  (50  RPM). 

Symptom:  The  command  is  given  to  go  up  to  90  RPM.  The  ship  goes  to 
90  RPM.  When  tank  cleaning  is  started  the  boiler  will  not 
maintain  pressure  without  excessive  black  smoke  in  PORT 
boiler. 


Failed  Component: 

NO.  11  -  2  air  heater  (dirty) 

Gauges  Affected: 

10  - high 

11  - high 

12  - high 

13  - high 

14  - low 

48 - slightly  low 

63— dark  smoke 


*******  Failure  22: 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM) . 

Symptom:  When  testing  the  boiler  water,  1  boiler  has  an  abnormal 
drop  in  alkalinity,  sulfite  and  phosphates  with  no  change 
in  chloride  content.  In  the  2  boiler  the  chemical 
concentration  increased. 

Failed  Component: 

No.  53  -  internal  desuperheater  (small  leak) 

Gauges  Affected: 

45 - slightly  low 


*******  Failure  23: 

Initial  Condition:  The  ship  is  underway  at  reduced  speed  (70  RPM). 

Symptom:  When  raising  to  normal  full  speed,  boiler  pressures  drop 
and  smoking  in  both  boilers.  Low  atomizing  steam  alarms 
sound . 

Failed  Component: 

No  90  -  atomizing  steam  reducing  station 

Gauges  Affected: 

6  - fluctuating 

15  - fluctuating 

7  - low 

8  - low 

9  - low 

16  - low 

17  - low 

18  - low 

28— low 

40 - low 

62— dark  smoke 


63 - dark  smoke 

69 - 15  psi 

71 - 15  psi 

73 - 8  psi 

75 - 7  psi 

77 - 5  psi 

79 - 7  psi 

81 - 3  psi 

83 - 3  psi 

85 - 8  psi 

87 - 7  psi 

89  - 9  psi 

91 - 7  psi 

93 - 9  psi 

Components  Affected: 

90  - open  100  Z 

Indicators  Affected: 

188,  190,  192 - all  extractions  are  closed 

256— scoop  closed 

259 - low  sea  suction  open 

231,  233,  235 - No.  1  cooling  water  circulator  on  also 

the  disc,  and  suction  valves  open. 


*******  Failure  24: 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM). 

Symptom:  Condensate  pump  discharge  pressure  lower  than  normal. 
Low  Condensate  P.  alarm  sounds. 

Failed  Component: 

No.  100  -  condensate  pump 

(pump  is  running  and  has  an  internal  failure) 

Gauges  Affected: 

19 - slightly  low 

27 - slightly  low 

36 - low 

58 - low  and  falling 

60 - high 

Components  Affected: 

79 - high  level  and  rising 

100 - low  amperage 

104 - open 


*******  Failure  25: 


Initial  Condition:  The  ship  is  in  port  at  dockside 

Symptom:  The  low  control  air  pressure  alarm  sounds  on  the  main  console. 
Failed  Component: 

No.  175  -  control  air  compressor 

Gauges  Affected: 

38 - low  and  falling 


Components  Affected: 

175 - electric  motor  running  and  broken  drive  belt 


*******  Failure  26: 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM) . 

Symptom:  Deaerating  feed  tank  level  is  marginally  low 
and  slowly  decreasing. 

Failed  Component: 

No.  116  -  Atmospheric  drain  tank  level  transmitter 

Gauge 8  Affected: 

58 - marginally  low 

61 - very  high 

Components  Affected: 

104 - slightly  open 

115 - tank  is  overflowing 


*******  Failure  27: 

Initial  Condition:  The  ship  has  just  received  departure  order 

and  is  steaming  at  full  speed  (90  RPM). 

Symptom:  Marginal  loss  of  main  condenser  vacuum. 

Failed  Component: 

No.  105  -  Condensate  recirculation  regulator 

Gauge 8  Affected: 

27 - marginally  low 

58 - 8 lightly  low 

60 - marginally  high 


Components  Affected: 

79 - high  level 

105 - open  80  X 
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*******  Failure  29: 

Initial  Condition:  The  ship  is  underway  at  full  speed  (90  RPM). 
Symptom:  Marginally  low  saltwater  service  system  pressure. 
Failed  Component: 

No.  171  -  Saltwater  service  pump  strainer 

Gauges  Affected: 

34 - marginally  low 

53 - marginally  high 

Components  Affected: 

72 - marginally  high  lube  oil  temperature 


Appendix  C 
Protocols 


Failure  19  (Vacuum  drop  at  reduced  speed)  (S31) 

L100:  First  indication  is  a  lot  o£  vacuum  loss. 

L101:  Lo,  first  thing  I'll  check  will  be  the  gland  seal 
L102:  (to  see)  if  it  has  pressure  after  regulator. 

L103:  You  know  on  the  board  it  is  at  least,  say,  1  or  2  pounds. 

L104:  If  that  is  OK,  I  believe,  you  check  circulator  maybe, 

L105:  to  see  that  you  have  enough  water  going  back  into  the  condenser 
L106:  to  keep  the  air  ejector  cool. 

E107 :  Why  the  air  ejector? 

L108:  Oh!  If  you  talk  about  the  vacuum  pump...  I  am  not  quite  familiar 
with  the  vacuum  pump.  Let  us  say  the  .... 

L109:  I  know  with  the  air  ejector  you  lose  vacuum  that  way. 

L110:  What  else  can  be  wrong? 

Elll:  Lo,  those  are  most  common  problems? 

L112:  Yes,  I  would  think  so,  unless  you... 

L113:  Assuming  the  condensate  pump  running  normally, 

L114:  that  will  be  the  first  indication,  the  major  cause. 

E115:  Lo,  what  else? 

L116:  Check  level  of  condenser. 

E117:  What  do  you  want  to  find? 

L118:  You  want  to  find  out  if  it  is  normal, 

L119:  if  it  is  too  high  then...  you  see  it  could  be  getting  water  from 
somewhere  else. 

El 20:  What  do  you  mean  by  somewhere  else? 

L121:  Say,  for  example,  condensate  pump  may  not  be  functioning  properly. 
El 22:  Think  aloud. 

L123:  It  would  be  ...,  well.  If  the  level  is  OK,  then  you  look  for 
something  else. 

L124:  We  just  slow  down. 

L125:  You  assume  everything  else  is  OK. 

L126:  In  other  words,  circulating  water  is  fine, 

L127:  no  reason  for  that,  which  you  think  probably  will  be  ... 

L128:  I  don't  think  it  has  any  problem  with  the  bleeds 
L129:  I  don't  know.  The  symptom  is... 

E130:  Vacuum  dropping  when  slowing  down  to  50  from  full  speed. 

L131:  Possibly  the  bleeds,  you  could  lose  it  that  way. 

E132:  Any  other  possibilities? 

L133:  Like  I  said,  the  dump  could  cause  it, 

L134:  when  you  have  problems  with  the  35  lb  line. 

L135:  If  that  was  the  problem  you  hear  the  35  lb  make  up  and  you  hear 
the  feedpump  speed  up  a  little. 

L136:  You  definitely  hear  the  35  lb  line  making  up 

L137 :  from  the  high  pressure  steam  coming  into  the  low  pressure  steam 
L138:  where  the  low  pressure  dumping  it  into  the  condenser. 
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L139:  So,  one  failure  would  cause  the  reaction  on  the  make  up  and  you 
hear  that 

L140:  you  could  remedy  that  problem  right  away. 

E141:  Even  before  the  vacuum  drop? 

L142:  I  would  think  yes. 

L143:  If  that  was  the  case,  vacuum  would  be  secondary 

LI 44:  because  you  would  have  other  problems.  On  our  ship  we  have  other 
things. 

L145:  You  could...  You  have  deck  return  coming  into  the  main  unit.  But 
that  should  not  be... 

L146:  Say  at  full  speed 

L147 :  it  would  not  be  making  any  effect.  I  don't  believe  from  other.... 
L148:  On  a  vacuum,  you  basically  looking  for  a  leak 
L149:  getting  into  the  main  unit  somehow. 

L150:  The  turbine...  that  is  basically  the  problem. 

E151:  how  about  the  deaerator?  If  there  is  a  hole  in  the  deaerator. 

L151 :  What  type  of  leak? 

L152:  There  is  a  pressure  on  the  deaerator. 

L153:  Deaerator  is  under  35  lb  pressure  aha...  goes  into  the  feedpump 
L154:  so  I  don't  think  there  is  any  problem  there. 

L155:  If  the  level  of  the  deaerator  changes, 

L156:  you'll  notice  if  the  makeup  for  the  dump  fail  for  the  deaerator, 
L157 :  and  you'll  notice  the  level  change. 

L158:  I  don't  believe  it  will  affect  the  vacuum  at  all. 

E159:  And  surely  the  vacuum  pump  itself  can  fail  that  will  cause  the 
vacuum  to  drop,  but  it  should  not  happen  when  slowing  down. 


Failure  23  (When  raising  to  normal  full  speed,  boiler  pressures  drop 
and  smoking  in  both  boilers.  Low  atomizing  steam  alarms 
sound.)  (S32) 

L200:  First  we  need  to  find  out  what's  causing  the  boiler  pressure  to 
drop. 

L201:  Check  fuel  oil  pressure.  You  said  it  is  OK. 

L202:  And  you  said  smoking. 

L203:  That'll  probably  indicate  the  pressure  would  be  extremely  high 

L204:  because  it  is  a  large  load  from  the  boiler. 

L205:  So  that  could  cause  the  smoke. 

L206:  It  would  be  too  much  demand  for  the  boiler  to  make  up. 

L207 :  That's  the  reason  the  boiler  pressure  is  dropping. 

L208:  With  low  atomizing  steam  we  have  to  check  the  130  lb  system  to  see 
if  that  is  normal. 

L209:  If  that's  normal,  we  have  to  go  to  the  atomizing  steam  regulator 
to  see  that  is  functioning  properly. 

L210:  If  that  fails,  you  have  poor  atomization  which  undoubtably  will 
trip  the  boiler. 

E211:  Cause  the  smoke? 

L212:  Surely,  because  you  have  low  atomizing  steam, 

L213:  and  you  would  have  to  slow  down  to  bring  the  pressure  back  so  you 
don't  lose  the  plant. 

L214:  Doing  that  will  bring  up  the  fuel  oil  pressure  back. 

L215:  Remedy  the  atomizing  steam  somewhat  or  bypass  it  and  find  out  the 
fault.  That  seems  to  be  the  problem. 

L216:  What  if  the  atomizing  steam  pressure  alarm  doesn't  go  off. 

L217:  Need  to  find  out  whether  there  is  too  much  demand  for  the  boiler, 

L218:  whether  the  boiler  pressure  is  dropping  in  the  auxiliary  system  or 
the  main  system. 

L219:  In  order  to  bring  the  pressure  back,  you  have  to  reduce  the  load. 

L220:  We'll  have  to  come  back  to  a  reduced  speed 

L221:  for  sure  to  secure  some  other  steam  demand  and  find  out  the  prob¬ 
lem. 

E222:  What's  the  procedure  to  find  out  the  problem? 

L223 :  You  have  to  check  to  see  if  both  burners  in  both  boilers  are 
operating  properly. 

L224:  If  the  fuel  oil  is  following  the  demand  and  the  air, 

L225:  if  there  is  not  enough  air  we  won't  get  proper  atomization.  I 
mean  proper  burning.  It  could  be. 

L226:  Have  to  check  the  different  pressure. 

L227 :  Whether  the  temperature  of  the  boiler  is  OK 

L228:  to  >ee  if  it  would  have  dirty  tubes. 

E229:  Blockage? 

L230:  Anything  could  be  blocked. 

L231:  It  could  be  having  extremely  cold  water  going  into  the  boiler. 

L232:  So  the  economizer  may  be  the  problem? 

L233:  Possibly  could  be  economizer  plugged  up,  or... 

E234:  Remember  we  have  smoke.  Does  it  help  to  narrow  down  the  possibil¬ 
ities? 

L233:  Supossing  the  atomizing  steam  is  proper? 

E236:  Yes. 


L237:  Not  enough  air. 

L238:  Because  if  it  is  not  enough  air,  fuel  oil  temperature  is  too  low, 
L239:  it  will  be  smoke. 

E240:  You  assume  black  smoke? 

L241 :  I  like  to  say  it  is  too  much  demand  whether  it  is  enough  heat 
transfer  for  the  fire  to  the  burner. 

L242:  Or  fuel  oil  pressure  too  high  then  you  have  smoke  because  too  much 
demand . 

E243:  Too  much  supply? 

L244:  Yes,  oh...  I  guess,  oh...  too  much  demand  on  the  boiler. 

L245:  It  will  try  to  make  up  more  fuel. 

L246:  More  fuel  will....  so  you  exceed  the  air, 

L247 :  not  enough  to  burn  them,  so  you  see  it  black. 

E248:  If  only  boiler  pressure  drops  and  black  smoke  are  noticed,  What  is 
the  most  possible  reason? 

L249:  Not  enough  air  I  think. 

E250:  How  does  the  air  affect  the  boiler  pressure? 

L251:  Boiler  pressure?  Well,  if  the  fuel  follows  the  air  it  will  drop. 
L252:  If  the  fuel  is  following  the  air,  you  shouldn't  have  black  smoke. 
However,  we  have  black  smoke  here. 

L253:  The  combustion  control,  the  air  damper  must  not  be  open  right. 
L254:  The  speed  of  the  forced  draft  fan  might  not  be  up  speeded. 

L255:  Or...  I  mean  it  is  a  process  of  elimination. 

L256:  I  mean  it  is  hard  to  sit  here  and  tell  you  what's  wrong. 

E257 :  But  you  can  assume,  see,  suppose  the  combustion  control  is  not 
functioning  well,  the  ratio  is  not  right. 

L258:  You  don't  have  enough  air. 

L259:  You  not  going  to  burn  right. 

L260:  And  you  won't  have  the  right  amount  of  mixture  for  the  air  and 
fuel  to  give  you  proper  burning. 

L261:  So  you  going  to  lose  pressure. 

E262:  Does  it  happen  on  any  condition? 

L263:  It  really  doesn't  matter. 

L264:  If  you  can't  get  the  right  fire,  you  not  going  to  get  pressure. 
L263:  I  mean  unless  there  is  no  demand.  You  going  to  get  the  pressure 
drop. 
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How  does  the  power  plant  work?  (S31) 


L301:  A  boiler  would  generate  the  steam, 

L302:  the  steam  leave  the  boiler  at  a  predetermined  pressure  and  tem¬ 
perature. 

L303:  It  leaves  the  boiler  at  different  levels  at  different  outlets,  \ 

L304:  e.g.  superheated  steam,  desuperheated  steam,  each  for  different 
purposes. 

L305:  The  superheated  steam  goes  into  the  throttles  for  the  main  tur¬ 
bine.  ' 

L306:  That  adjusts  the  direction  or  the  amount  of  steam  goes  it  tur¬ 
bines.  ' 

L307 :  Turbines  use  the  heat  and  steam  to  get  the  work  out. 

L308:  And  after  it  is  used  up  the  thermo  energy, 

L309:  it  is  cool  and  changed  back  from  steam  to  water  back  in  the  con¬ 
denser. 

L310:  The  condensate  is  pumped  from  within  the  air  ejector  up  into  the 

dc  heater.  « 

L311:  Dc  heater  will  heat  it  and  the  deaerator  take  the  air  out  of  it. 

L312:  You  don't  want  the  air  to  enter  the  boiler  system. 

L313:  From  there  it  goes  into  the  feedpump. 

L314:  And  return  to  the  boiler  and  heated  in  different  stages. 


L315 : 
L316: 

So  it  is  a  closed  loop  per  se. 

Then  you  have  all  the  auxiliaries  which  operate  out 

of 

different 

L317: 

L318: 

pressure  steam  to  maintain  the  powerplant. 

We  have  35  lb  line  which  is  used  for  gland  seal. 

For  the  deaerator  you  have  the  150  lb  line  which  is 

used 

for 

the 

L319: 

L320: 

make  up. 

150  lb  line  for  the  fuel  oil  heater 

and  regular  steam  which  is  desuperheated  steam  (600 

lb) 

is 

used 

for  the  feedpump. 
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How  does  gland  seal  work?  (S31) 


L401 : 
L402 : 

L403 : 

L404: 

L405: 

L406 : 

L407 : 

L408: 
L409: 
L410: 
L411 : 
L412: 

L413 : 

L414: 

E415: 

L416 : 
L417 : 

L418: 

L419: 
L420: 
L421 : 

L422: 

L423: 

L424: 


What  gland  seal  is  trying  to  do  is  to  prevent  air  from  leaking 
from  outside  of  the  casing  into  the  casing  eventually  dump  into 
the  vacuum. 

What  it  does  is  to  seal  off  the  air  which  would  be  sucked  in  oth¬ 
erwise 

because  there  is  a  difference  in  pressure  between  the  vacuum  and 
the  atmosphere. 

Sr>  under  full  speed  with  steam  admitting  into  the  turbine  at  a 
high  enough  pressure 

when  the  steam  is  trying  to  escape,  you  don't  need  gland  seal 
because  there  is  pressure  within  the  casing. 

So  what  you  do  is  keep  that  steam  in  that  casing,  so  air  can't 
come  in. 

So  the  gland  seal  prevents  the  air  you  don't  want  in  the  system. 

In  reduced  speed  you  would  need  the  gland  seal. 

You  have  to  be  able  to  maintain  the  vacuum. 

It  is  created  with  the  steam  changing  back. 

The  volume  of  steam  going  back  to  volume  of  water  which  creates 
the  vacuum. 

Also  it  keeps  the  temp,  e.g.  if  you  have  air  sucking  in  or.  the 
rotor,  it  is  not  a  good  condition. 

It  may  have  a  tendency  to  bolt  the  rotor  or  something  to  that 
effect  if  it  is  in  a  standing  position. 

Do  you  have  any  experience  when  you  were  on  a  ship,  some  problem 
happened  that  reminded  you  of  some  other  symptoms. 

You  don't  always  remember  what  happened. 

I  may  not  remember  the  exact  cause  of  the  effect.  But  I  may  be 
able  to .... 

just  by  experience  in  my  own  mind  that  I  know  to  check  this  one 
thing, 

a  direction,  which  I  believe  is  the  definition  of  experience. 

Say,  if  I'm  at  full  speed,  and  vacuum  is  coming  down. 

I'll  look  the  level  of  the  condenser  because  7,  10  years  ago  I  was 
a  third,  we  got  a  departure, 

this  happened  to  me,  I  didn't  know  waht  to  do  at  that  time. 

But  down  the  line,  this  did  happen  to  me  again. 

But  now  it  is  such  a  common  thing  to  me.  Of  course,  the  experi¬ 
ence  I  never  forget. 


Failure  8  (Vibration,  noise  and  speed  fluctuation  occurred  in  the  no.  1 
feedpump.  When  the  engineer  started  the  no.  2  pump,  the  same 
symptoms  occurred  immediately.)  (S34) 


L450:  Well,  maybe  loss  of  suction  because  that  is  common  to  both  pumps. 
L451:  Check  the  level  of  the  deaerator  (normal). 

L452:  I  have  checked  out  the  suction  side  of  the  water  side  of  both 
pumps . 

L453 :  Now  1  will  check  out  discharge  side  of  both  pumps. 

L454:  Check  out  the  condition  of  the  condensate  pump  system.  This  will 
give  me  what  valves  are  open. 

L455:  OK,  next  one  on  the  water  side . HP  heater  (normal). 

L456:  What  will  cause  fluctuation.  Maybe  there  is  .... 

L457 :  I  will  go  to  my  steam  system. 

L458:  Now  I  have  eliminate  the  suction  water  side  of  the  pump.  I  am  up 
to  the  economizer. 

L459:  Check  temp  gauges  coming  in  and  out  of  economizer. 

L460:  I  wonder  if  there  is  a  leakage  in  the  economizer? 

L461:  Maybe  the  economizer  is  leaking. 

L462:  Sometimes  when  you  started  up  a  pump  it  will  cause  speed  fluctua¬ 
tion  and  trip  out. 

L463:  Inlet  is  normal;  check  the  outlet. 

L464:  If  outlet  is  low,  then  that  means  the  economizer  is  plugged  up. 
L465:  Normal.  So  every  thing  on  the  water  side  is  OK. 

L466:  Now  I  am  going  to  the  steam  side  of  the  feedpump. 

L467:  Next  thing  I  am  looking  for  is  what  is  common  to  both  pumps. 

L468:  Check  if  35  lb  line  has  back  pressure.... 

L469:  Check  if  35  lb  pressure  is  normal  (normal). 

L470:  If  that  is  normal,  back  pressure  cannot  be  low. 

L471:  I  will  be  getting....  Pump  would  not  be  fluctuated. 

L472:  Now  go  on  to  the  steam  side  of  the  inlet. 

L473:  I  cannot  be  nothing  that  is  common... 

L474:  I  am  going  to  be  backtracking. 

L475:  Desuperheated  steam  for  my  feedpump.... 

L476:  Check  if  both  of  them  come  out  right. 

L477 :  I  want  to  find  a  cause  common  to  both  feedpumps. 

L478:  I  have  checked  my  pressure  and  levels. 

L479:  I  wonder  about  if  the  temp  of  the  steam  is  relevant... 

L480:  If  cold  water  goes  into  the  feedpump, 

L481:  It  would  cause  it  to  flash  the  steam 
L482:  And  the  pumps  would  lose  suction. 

L483:  A  lot  of  times  when  we  have  problems  with  our  deaerator 
L484:  And  we  have  to  bypass  the  deaerator. 

L485:  We  take  water  from  our  distilled  tank. 

L486:  The  water  in  the  deaerator  is  275  degrees, 

L487 :  And  the  water  coming  from  the  distilled  tank  is  90  degrees. 

L488:  Difference  in  change  will  cause  it  to  flash. 

L489:  And  the  pump  will  lose  suction. 

L490:  I  am  going  to  check  out  the  temp  of  the  water  coming  from  the 
deaerator. 


L491:  I  am  going  to  condensate  system  again. 

L492:  Sometimes  it  is  lube  oil,  I  may  lose  lube  oil. 

L493:  Nothing  there,  what  else  can  cause  it? 

L494:  Maybe  the  lube  oil  cooler  (pressure  of  salt  water). 

L495:  It  says  OK. 

L496:  The  only  thing  I  can  go  on  is  my  condensate  system  and  steam  sys¬ 
tem. 

L497 :  We  have  eliminated  condensate  system  because  water  is  at  normal 
pressure,  tern  and  level. 

L498:  The  economizer  says  OK. 

L499:  .  Steam  temp  may  be  a  problem. 

L500:  20  degrees  low  coming  out  of  the  desuperheater. 

L501:  Temp  of  steam  could  cause  fluctuation  of  the  feedpump 
L502:  Because  the  higher  the  temp  the  more  the  work  you  get  out  of  the 
steam. 

L503:  So  if  it  is  20  degrees  low,  it  got  to  be  something  wrong  in  my 
desuperheater. 

L504:  I  am  now  backtracking....  coming  out  of  the  superheater 
L505:  Going  into  the  desuperheater. 

L506:  Backtrack  a  little  bit  further. 

L507:  Come  out  of  the  drum  going  to  my  superheater. 

L508:  I  am  going  to  check  my  drum  level  (normal). 

L509:  So,  back  into  my  superheater. 

L510:  It  has  to  be  the  attemperator ,  the  desuperheater  or  the 
superheater. 

(At  this  point  S33  found  the  solution  -  desuperheater  failure) 

E511:  Why  the  pressure  is  normal  but  the  temp  is  abnormal? 

L512:  Maybe  there  is  a  leak  in  the  desuperheater. 

L513:  That  causes  loss  of  chemicals. 

L314:  Temp  will  be  lower. 

L513:  If  I  have  a  leak  it  will  go  into  my  drum. 

L516:  My  flow  from  the  desuperheater  outlet  will  be  slower, 

L317:  And  it  would  be  cooling  down  too  much. 


Failure  9  (White  haze  is  seen  on  STBD  boiler  periscope)  (S33) 


L350:  White  haze  means  too  much  air. 

L551:  Let  us  check  the  STBD  burners'  management  system.  It  says  it  is 
running . 

L552:  Look  at  the  air  system.  Too  much  air  may  be  the  problem. 

L553:  So  burners'  management  sytem  is  OK.  Let  us  start  with  the  fan  and 
work  our  way  down. 

L554:  I  got  pressure  to  the  boiler  out  of  the  fan. 

L555:  Signals  go  into  the  combustion  control  of  that  fan, 

L556:  that  regulates  the  fan. 

L557 :  Maybe  the  combustion  control  is  sending  a  wrong  signal  to  the  top 
of  the  diaphragm. 

L558:  Or  the  diaphragm  is  broken. 

L559:  If  it  fails  open  then  I  have  more  air  than  I  need. 

L560:  So  check  the  air  pressure. 

L561:  It  is  high. 

L562:  The  fan  is  OK  since  it  is  running.  Maybe  the  crossover .  nor¬ 

mal. 

L363:  Go  down  the  line  see  if  I  can  pick  up  something  else. 

L364:  Check  windbox  down  the  line.  It  says  high .  but  crossover  is 

normal . 

L363:  So  nothing  is  wrong  along  this  line.  FD  fan  is  running. 

L366:  Maybe  it  has  to  be  slow.  Well,  OK. 

L567 :  I  will  say  maybe  my  fuel  oil  has  problem. 

L368:  My  damper  is  staying  normal  but  something  happens  to  my  fuel  oil. 
L369:  Something  has  to  stick. 

L570:  Crossover  is  OK,  inlet  damper  is  allright,  outlet  is  lower  all  the 
way  down  my  line. 

L371:  Unless  I  have  a  dirty  burner.... 

L372:  But  that  will  give  out  black  smoke. 

L373:  So  I  have  eliminated  my  burner.  My  burner  is  OK. 

L374:  Let  us  check  out  the  fuel. 

L373:  Let  us  check  out  the  temp  of  the  fuel. 

L376:  Temp  is  normal,  pressure  is  slightly  high. 

L377:  If  that  is  high....  my  air  is  high  too. 

L378:  One  supposes  to  follow  the  other. 

L379:  That  is  a  part  of  the  burners'  management. 

L380:  Maybe  something  stuck  along  the  fuel  oil  line. 

L381:  Fuel  oil  may  then  be  slightly  high. 

L382:  Combustion  control  sends  the  signal  to  my  air  system. 

L383:  So,  let  us  check  the  regulating  valve,  check  the  flow  before  it. 
L384:  Slightly  high....  I  am  going  further  up  the  fuel  oil  system. 

L383:  Fuel  oil  temp  is  OK,  pressure  is  high. 

L386:  Check  the  strainer...  no,  that  cannot  be  it. 

L387 :  Because  that  should  cause  pressure  to  go  low. 

L388:  Go  back  to  the  fuel  oil  pipe. 

L389:  Check  the  fuel  oil  pressure  regulator. 

L390:  Normal....  Go  back  further. 

L391:  Why  the  pressure  is  high? 
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L592:  Something  wrong  in  the  pump?  No,  it  cannot  be  because  it  would  be 
low  too. 

L593:  If  the  relief  valve  is  open.... 

L594:  It  would  be  low  too. 

L595:  Go  back  to  my  burner  again. 

L596:  Maybe  my  burner  is  clogged. 

L597 :  But  that  would  be  black  smoke. 

L598:  So . temp  is  normal...... 

L599:  I  am  just  going  to  say  maybe  it  runs  out  another  pump. 

L600:  I  will  check  out  the  fuel  oil  pump. 

L601:  Pump  is  OK. 

L602:  Maybe  my  damper  open  too  much. 

L603 :  Check  the  other  boilr  see  if  pressure  going  in  is  high....  normal. 

L604:  ...... .slightly  high  fuel  oil  pressure  and  too  much  air.... 

L605.  I  am  stuck. 


SYMBOLS 


PRESSURE  INDICATOR 
TEMPERATURE  INDICATOR 
LEVEL  INDICATOR 
FLOW  INDICATOR 
SMOKE  INDICATOR 
VALVE 

ANGLE  VALVE 

STRAINER 

PUMP 

LEVEL  TRANSMITTER 
PRESSURE  TRANSMITTER 
TEMPERATURE  TRANSMITTER 
FLOW  TRANSMITTER  * 
MOTOR  OPERATED  VALVE 
SAFETY  VALVE 
RELIEF  VALVE 
AIR  OPERATED  VALVE 


AIR  ACTUATOR 


fUEl  OH 
UM»»  CONI 


L  SUPPLY  SYSTEM 


MAIN  STEAM,  AUXILIARY  STEAM,  AND 
CIRCULATING  WATER  SYSTEMS 


; 


r 


» 


CC  z 


LUBRICATING  OIL  SYSTEM 


IUHBO  GEN  l  0 
com  ER 
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SALTWATER  SERVICE  SYSTEM 


10  IP  IXfHACIlON  RICULAION 


CONTROL  AIR  SYSTEM 
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Marine  Powerplant  Troubleshooting 


You  are  about  to  participate  in  an  exercise  where  a  marine 
powerplant  is  simulated  on  an  Apple  personal  computer.  The  computer 
will  display  symptoms  resulting  from  a  realistic  failure  in  a 
component  or  a  subsystem  of  a  marine  powerplant.  The  symptoms 
correspond  to  the  failure  of  a  single  component  or  a  subsystem,  or  a 
single  identifiable  cause  such  as  contaminated  water.  In  addition  to 
the  computer  display,  you  are  provided  with  a  set  of  schematic 
diagrams  and  a  list  of  gauges  and  other  status  indicators.  The 
schematic  diagrams,  gauges,  and  status  indicators  correspond  to  the 
engine  room  simulator  at  MSI. 

You  can  check  the  status  of  the  gauges  and  displays  by 
interrogating  the  computer  using  a  menu  of  commands.  These  commands 
enable  you  to  check  individual  components,  group  of  annunciators, 
diagnose  and  declare  the  failed  component,  or  quit.  A  number  of  trial 
runs  will  be  used  to  familiarize  you  with  the  simulation. 


Each  run  incorporates  a  single  failure.  Status  of  relevant 
displays  is  stored  in  the  computer  for  each  failure.  To  check  the 
displays  corresponding  to  a  subsystem,  you  must  identify  that  system 
by  its  unique  identification  number.  These  numbers  are  indicated  in 
the  appropriate  locations  on  the  schematic  diagrams. 


Even  though  a  failed  component  may  result  in  continuous  change  in 
status  in  real  life  systems,  the  status  of  displays  in  our  simulator 
has  been  frozen  at  a  certain  arbitrary  point  in  time  after  the  failure 
has  occurred.  Hence,  you  get  a  "snapshot"  view  of  displays  after  the 
failure  has  occurred. 


You  may  check  as  many  gauges  and  displays  as  you  want  before 
identifying  the  failed  component.  However,  you  should  try  to  keep  the 
number  of  checks  and  readings  to  a  minimum  since  in  real  life  each 
reading  would  incur  some  cost.  You  are  expected  to  identify  the  cause 
of  the  failure  within  the  allotted  time.  Preferably,  you  should 
identify  the  cause  as  quickly  as  possible. 


A  number  of  training  runs  will  be  provided  so  that  the  exercise 
procedure  becomes  clear.  By  the  end  of  these  trials,  you  will  be 
familiar  with  the  command  options  on  the  computer,  and  the  schematic 
diagrams.  Please  feel  free  to  ask  questions  during  any  part  of  the 
trial  runs. 


Please  do  not  discuss  this  exercise  with  any  other  trainee. 
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Appendix  F 
Command  List 


COMMAND  LIST 


COMMAND  GAUGE 


COMAND  GAUGE 
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Boiler  Gauges 


STBD. 

PORT 

Pressures 

G1 

G10 

Forced  Draft  Fan  Discharge 
Pressure 

G2 

Gil 

Wind  Box  Pressure 

G3 

G12 

Furnace  Pressure 

G4 

G13 

Economizer  Inlet  Pressure 
(Gas  Side) 

G5 

G14 

Uptake  Pressure 

G6 

G15 

Fuel  Oil  to  Burners  Pressure 

G7 

G16 

Boiler  Drum  Pressure 

G8 

G17 

Superheated  Steam  Pressure 

G9 

G18 

Desuperheated  Steam  Pressure 

Temperatures 

G41 

G47 

Fuel  Oil  Temperature 

G42 

G48 

Air  Heater  Outlet  Temperature 

G43 

G49 

Economizer  Inlet  Temperature 

G44 

G50 

Economizer  Outlet  Temperature 

G45 

G51 

Desuperheated  Steam  Temperature 

G46 

G52 

Superheated  Steam  Temperature 

Levels 

G56 

G57 

Boiler  Drum  Level 

G62 

G63 

Smoke  Indicators 

Flow 

G64 

G65 

Fuel  Oil  Flow 

BOTH 

G38 

Control  Air  Pressure 

G39 

Fuel  Supply  Header  Pressure 

G40 

Atomizing  Steam  Pressure 

G66 

Fuel  Oil  Settling  Tank  Level 

Transducer  and  Transmitter  Gauges 

SUPPLY  LOADING 


G19 

G20 

G21 

G22 

G23 

G24 

G25 

G26 

G27 

G28 

G29 

G30 

G31 

G32 

G33 

G34 

G35 

G36 

G37 

G94 


G53 

G55 

G95 

G96 


G54 

G58 

G59 

G60 

G61 

G67 


G68 

G69 

H.P.  Extraction  Regulator 

COMMAND 

G70 

G71 

I.P.  Extract ion  Regu 1 ator 

G72 

G73 

875/150  PSIG  Reducing  Station 

11 

G74 

G75 

875/35  PSIG  Reducing  Station 

12 

G76 

G77 

Make-up  Feed  Regulator 

13 

G78 

G79 

Condensate  Recirculator 

14 

Regulator 

15 

G80 

G81 

Deaerating  Feed  Tank  Dump 

16 

Regulator 

17 

G82 

G83 

35  P  IC  System  Dump  Regulator 

18 

G84 

G85 

Atomizing  Steam  Regulator 

19 

G86 

G87 

Stbd.  Boiler  Steam  Flow 

110 

Transmitter 

Ill 

G88 

G89 

Stbd.  Boiler  Drum  Level 

112 

Transmitter 

G90 

G91 

Port  Boiler  Steam  Flow 

Transmitter 

G92 

G93 

Port  Boiler  Drum  Level 

Transmitter 


Other  Engine  Room  Pressures 
Deaerator  Pressure 
Feedwater  Header  Pressure 
Throttle  Hydraulic  Oil 
Pressure 

150  PSIG  System  Pressure 
35  PSIG  System  Pressure 

H. P.  Extraction  Pressure 

I. P.  Extraction  Pressure 
L.P.  Extraction  Pressure 
Condenser  Vacuum 

Throttle  Supply  Steam  Pressure 
Ahead  Turbine  Pressure 
Astern  Turbine  Pressure 
Gland  Seal  Steam  Pressure 
Lube  Oil  Header  Pressure 
Main  Engine  Lube  Oil  Pressure 
Salt  Water  Service  System 
Pressure 

Main  Cooling  Water  Pressure 
Condensate  Pump  Pressure 
Turbo  Generator  Lube  Oil 
Pressure 

Main  Cooling  Water  Disch. 

Press . 

Other  Engine  Room  Temperatures 

Feedpump  Lube  Oil 

L.P.  Turbine  Exhaust 

Main  Cooling  Water 

Main  Cooling  Water  Discharge 

Other  Engine  Room  Levels 
Lube  Oil  Sump  Tank  Level 
Deaerator  Tank  Level 
Lube  Oil  Gravity  Tank  Level 
Main  Condenser  Level 
Atmospheric  Drain  Tank  Level 
Distillate  Tank  Level 


STATUS  INDICATOR  PAGES 

Throttle  Control 
Main  Turbine  Bearing  Temps 
Stbd.  Boiler  Burner  Mgm. 
Port  Boiler  Burner  Mgm. 
Salinity 

SSTG  Bearing  Temps 
Bleed  Status 
Lube  Oil  System 
Condensor  System 
Turbine  Viax  Panel 
Fuel  Oil  System 
Feedpump  Status 
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